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Preface 



We are fortunate to live in an age in which microelectronics still enjoy an accelerating 
growth in performance and complexity. Fortunate, since we are experiencing a remark- 
able progress in science, in communication technology, in our ability to acquire new 
knowledge, and in the many other wonderful amenities of modern society, all of which 
are permeated by and made possible by modern microelectronics. This exponential evolu- 
tionary trend, as described by Moore’s Law, has now lasted for more than three decades, 
and is still on track, fueled by a seemingly unending demand for ever better performance 
and by fierce global competition. 

A driving force behind this fantastic progress is the long-term commitment to a steady 
downscaling of MOSFET/CMOS technology needed to meet the requirements on speed, 
complexity, circuit density, and power consumption posed by the many advanced appli- 
cations relying on this technology. The degree of scaling is measured in terms of the 
half-pitch size of the first-level interconnect in DRAM technology, also termed the “tech- 
nology node” by the International Technology Roadmap for Semiconductors. At the time 
of the 2001 ITRS update, the technology node had reached 130 nm, while the smallest 
features, the MOSFET gate lengths, were a mere 65 nm. Within a decade, these numbers 
are expected to be close to 40 nm and 15 nm, respectively. 

Very important issues in this development are the increasing levels of complexity of 
the fabrication process and the many subtle mechanisms that govern the properties of deep 
submicrometer FETs. These mechanisms, dictated by device physics, have to be described 
and implemented into circuit design tools to empower the circuit designers with the ability 
to fully utilize the potential of existing and future technologies. 

Hence, circuit designers are faced with the relentless challenge of staying updated on 
the properties, potentials, and the limitations of the latest device technology and device 
models. This is especially true for designers of analog and radio frequency (RF) integrated 
circuits, where the sensitivity to the modeling details and the inteiplay between individual 
devices is more acute than for digital electronics. A deeper insight into these issues is 
therefore crucial for gaining the competitive edge needed to ensure first-time-right silicon 
and to reduce time-to-market for new products. 

Existing textbooks on analog and RF CMOS circuit design traditionally lack a thorough 
treatment of the device modeling challenges outlined above. Our primary objectives with 
the present book is to bridge the gap between device modeling and analog circuit design 
by presenting the state-of-the-art MOSFET models that are available in analog and SPICE- 
type circuit simulators today, together with related modeling issues of importance to both 
circuit designers and students, now and in the future. 
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This book is intended as a main or supplementary text for senior and graduate-level 
courses in analog integrated circuit design, as well as a reference and a text for self 
or group studies by practicing design engineers. Especially in student design projects, 
we foresee that this book will be a valuable handbook as well as a reference, both on 
basic modeling issues and on specific MOSFET models encountered in circuit simulators. 
Likewise, practicing engineers can use the book to enhance their insight into the principles 
of MOSFET operation and modeling, thereby improving their design skills. 

We assume that the reader already has a basic knowledge of common electronic 
devices and circuits, and fundamental concepts such as small-signal operation and equiv- 
alent circuits. 

The book is organized into twelve chapters. In Chapter 1, the reader is introduced 
to the basic physics, the principles of operation, and the modeling of MOS structures 
and MOSFETs. This chapter also discusses many of the issues that are important in the 
modeling of modem-day MOSFETs. Chapter 2 walks the reader through the fabrication 
steps of modem MOSFET and CMOS technology. In Chapter 3, the special concerns 
and the challenges of accurate modeling of MOSFETs operating at radio frequencies 
are discussed. Chapter 4 deals with modeling of noise in MOSFETs. Distortion analysis, 
discussed in Chapter 5, is of special concern for analog MOSFET circuit design. In 
Chapters 6, 7, and 8, we present the state-of-the-art MOSFET models that are commonly 
used by the analog design community today. The models covered are BSIM4, EKV, MOS 
Model 9 and MOSA1. These chapters are written in a reference style to provide quick 
lookup when the book is used like a handbook. Chapters 9 and 10 are devoted to the 
modeling of other devices that are of importance in typical analog CMOS circuits, such 
as bipolar transistors (Chapter 9) and passive devices, including resistors, capacitors, and 
inductors (Chapter 10). The remaining two chapters deal with essential industry-related 
issues of circuit design. Chapter 1 1 discusses the important topic of modeling of process 
variations and device mismatch effects and Chapter 12 deals with the quality assurance 
of the device models used by the design houses. 

The book is accompanied by two software application tools, AIM-Spice and MOSCalc. 
AIM-Spice is a version of SPICE with standard SPICE parameters, very familiar to many 
electrical engineers and electrical engineering students. Running under the Microsoft Win- 
dows family of operating systems, it takes full advantage of the available graphics user 
interface. The AIM-Spice software will run on all PCs equipped with Windows 95, 98, 
ME, NT 4, 2000, or XP. In addition to all the models included into Berkeley SPICE 
(Version 3e.l), AIM-Spice incorporates BSIM4, EKV, and MOSA1, which were cov- 
ered in Chapters 6, 7, and 8. A limited version of AIM-Spice can be downloaded from 
www.aimspice.com . The second tool, MOSCalc, is a Web-based calculator for rapid esti- 
mates of MOSFET large- and small-signal parameters. The designer enters the gate length 
and width, and a range of biasing voltages and/or the transistor currents, whereupon quan- 
tities such as gate overdrive voltage, effective threshold voltage, drain-source saturation 
voltage, all terminal currents, transconductance, channel conductance, and all small signal 
intrinsic capacitances are calculated. MOSCalc is available at ngl.fysel.ntnu.no. 

These dedicated software tools allow students to solve real engineering problems, which 
brings semiconductor device physics and modeling home to the user at a very practical 
level, bridging the gap between theory and practice. AIM-Spice and MOSCalc can be used 
routinely by practicing engineers during the design phase of analog integrated circuits. 
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1 

MOSFET Device Physics 
and Operation 



1.1 INTRODUCTION 

A field effect transistor (FET) operates as a conducting semiconductor channel with two 
ohmic contacts - the source and the drain - where the number of charge carriers in the 
channel is controlled by a third contact - the gate. In the vertical direction, the gate- 
channel-substrate structure (gate junction) can be regarded as an orthogonal two-terminal 
device, which is either a MOS structure or a reverse-biased rectifying device that controls 
the mobile charge in the channel by capacitive coupling (field effect). Examples of FETs 
based on these principles are metal-oxide-semiconductor FET (MOSFET), junction FET 
(JFET), metal-semiconductor FET (MESFET), and heterostructure FET (HFETs). In all 
cases, the stationary gate-channel impedance is very large at normal operating conditions. 
The basic FET structure is shown schematically in Figure 1.1. 

The most important FET is the MOSFET. In a silicon MOSFET, the gate contact 
is separated from the channel by an insulating silicon dioxide (Si02) layer. The charge 
carriers of the conducting channel constitute an inversion charge, that is, electrons in the 
case of a p - type substrate (n -channel device) or holes in the case of an n-type substrate 
(p-channel device), induced in the semiconductor at the silicon-insulator interface by the 
voltage applied to the gate electrode. The electrons enter and exit the channel at n + source 
and drain contacts in the case of an n-channel MOSFET, and at p 1 contacts in the case 
of a p-channel MOSFET. 

MOSFETs are used both as discrete devices and as active elements in digital and 
analog monolithic integrated circuits (ICs). In recent years, the device feature size of 
such circuits has been scaled down into the deep submicrometer range. Presently, the 
0.13-p.m technology node for complementary MOSFET (CMOS) is used for very large 
scale ICs (VLSIs) and, within a few years, sub-0. 1-p.m technology will be available, 
with a commensurate increase in speed and in integration scale. Hundreds of millions of 
transistors on a single chip are used in microprocessors and in memory ICs today. 

CMOS technology combines both n -channel and /^-channel MOSFETs to provide very 
low power consumption along with high speed. New silicon-on-insulator (SOI) technology 
may help achieve three-dimensional integration, that is, packing of devices into many 
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Semiconductor substrate 

I Substrate contact 

I 
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Figure 1.1 Schematic illustration of a generic field effect transistor. This device can be viewed 
as a combination of two orthogonal two-terminal devices 



layers, with a dramatic increase in integration density. New improved device structures 
and the combination of bipolar and field effect technologies (BiCMOS) may lead to 
further advances, yet unforeseen. One of the rapidly growing areas of CMOS is in analog 
circuits, spanning a variety of applications from audio circuits operating at the kilohertz 
(kHz) range to modem wireless applications operating at gigahertz (GHz) frequencies. 



1.2 THE MOS CAPACITOR 

To understand the MOSFET, we first have to analyze the MOS capacitor, which consti- 
tutes the important gate-channel-substrate structure of the MOSFET. The MOS capacitor 
is a two-terminal semiconductor device of practical interest in its own right. As indi- 
cated in Figure 1.2, it consists of a metal contact separated from the semiconductor by 
a dielectric insulator. An additional ohmic contact is provided at the semiconductor sub- 
strate. Almost universally, the MOS structure utilizes doped silicon as the substrate and 
its native oxide, silicon dioxide, as the insulator. In the silicon-silicon dioxide system, 
the density of surface states at the oxide -semiconductor interface is very low compared 
to the typical channel carrier density in a MOSFET. Also, the insulating quality of the 
oxide is quite good. 




Figure 1.2 Schematic view of a MOS capacitor 
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We assume that the insulator layer has infinite resistance, preventing any charge carrier 
transport across the dielectric layer when a bias voltage is applied between the metal and 
the semiconductor. Instead, the applied voltage will induce charges and counter charges 
in the metal and in the interface layer of the semiconductor, similar to what we expect in 
the metal plates of a conventional parallel plate capacitor. However, in the MOS capacitor 
we may use the applied voltage to control the type of interface charge we induce in the 
semiconductor - majority carriers, minority carriers, and depletion charge. 

Indeed, the ability to induce and modulate a conducting sheet of minority carriers at 
the semiconductor-oxide interface is the basis for the operation of the MOSFET. 



1.2.1 Interface Charge 

The induced interface charge in the MOS capacitor is closely linked to the shape of 
the electron energy bands of the semiconductor near the interface. At zero applied volt- 
age, the bending of the energy bands is ideally determined by the difference in the 
work functions of the metal and the semiconductor. This band bending changes with the 
applied bias and the bands become flat when we apply the so-called flat-band voltage 
given by 

V FB = (<*>m - <b t )/q = (d> m -X s -E c + E P )/q , (1.1) 

where <F m and <t> s are the work functions of the metal and the semiconductor, respectively, 
X s is the electron affinity for the semiconductor, E c is the energy of the conduction band 
edge, and E F is the Fermi level at zero applied voltage. The various energies involved 
are indicated in Figure 1.3, where we show typical band diagrams of a MOS capacitor 
at zero bias, and with the voltage V = V FB applied to the metal contact relative to the 
semiconductor-oxide interface. (Note that in real devices, the flat-band voltage may be 



Vacuum level 




Oxide 




(b) 



Figure 1.3 Band diagrams of MOS capacitor (a) at zero bias and (b) with an applied voltage 
equal to the flat-band voltage. The flat-band voltage is negative in this example 
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affected by surface states at the semiconductor-oxide interface and by fixed charges in 
the insulator layer.) 

At stationary conditions, no net current flows in the direction perpendicular to the 
interface owing to the very high resistance of the insulator layer (however, this does 
not apply to very thin oxides of a few nanometers, where tunneling becomes important, 
see Section 1.5). Hence, the Fermi level will remain constant inside the semiconductor, 
independent of the biasing conditions. However, between the semiconductor and the metal 
contact, the Fermi level is shifted by f?Fm - £fs — qV (see Figure 1.3(b)). Hence, we have 
a quasi-equilibrium situation in which the semiconductor can be treated as if in thermal 
equilibrium. 

A MOS structure with a p - type semiconductor will enter the accumulation regime of 
operation when the voltage applied between the metal and the semiconductor is more 
negative than the flat-band voltage (Vfb < 0 in Figure 1.3). In the opposite case, when 

V > Vfb> the semiconductor-oxide interface first becomes depleted of holes and we 
enter the so-called depletion regime. By increasing the applied voltage, the band bending 
becomes so large that the energy difference between the Fermi level and the bottom of 
the conduction band at the insulator- semiconductor interface becomes smaller than that 
between the Fermi level and the top of the valence band. This is the case indicated for 

V — OV in Figure 1.3(a). Carrier statistics tells us that the electron concentration then 
will exceed the hole concentration near the interface and we enter the inversion regime. 
At still larger applied voltage, we finally arrive at a situation in which the electron volume 
concentration at the interface exceeds the doping density in the semiconductor. This is 
the strong inversion case in which we have a significant conducting sheet of inversion 
charge at the interface. 

The symbol i J/ is used to signify the potential in the semiconductor measured relative 
to the potential at a position x deep inside the semiconductor. Note that t// becomes 
positive when the bands bend down, as in the example of a p-type semiconductor shown 
in Figure 1.4. From equilibrium electron statistics, we find that the intrinsic Fermi level 
E[ in the bulk corresponds to an energy separation qtp b from the actual Fermi level E\ 
of the doped semiconductor, 




Figure 1.4 Band diagram for MOS capacitor in weak inversion (<p b < \jr s < 2 <p b ) 
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where V th is the thermal voltage, V a is the shallow acceptor density in the p-type semicon- 
ductor and /?i is the intrinsic carrier density of silicon. According to the usual definition, 
strong inversion is reached when the total band bending equals 2 qcpb, corresponding to the 
surface potential i// s = 2 <p b . Values of the surface potential such that 0 < x[r s < 2 <p b corre- 
spond to the depletion and the weak inversion regimes, i/f s = 0 is the flat-band condition, 
and xj/ s < 0 corresponds to the accumulation mode. 

The surface concentrations of holes and electrons are expressed in terms of the surface 
potential as follows using equilibrium statistics, 

Ps = N a exp(— V's/Vth), (1-3) 

n s = n\/p s — n po exp (i/^/ V*), (1-4) 

where « po = nf/N a is the equilibrium concentration of the minority carriers (electrons) 
in the bulk. 

The potential distribution \jr(x) in the semiconductor can be determined from a solution 
of the one-dimensional Poisson’s equation: 

d 2 x[r(x) _ p(x) 

da 2 ~ s s ( ' ’ 

where e s is the semiconductor permittivity, and the space charge density p(x) is given by 

p(x) = q(p - n - V a ). (1.6) 

The position-dependent hole and electron concentrations may be expressed as 

p = A f a exp(-i/'/Vth), (1.7) 

n = n po exp(Vf/V th ). (1.8) 



Note that deep inside the semiconductor, we have i// ( oo ) — 0. 

In general, the above equations do not have an analytical solution for i J/(x). How- 
ever, the following expression can be derived for the electric field F s at the insula- 
tor-semiconductor interface, in terms of the surface potential (see, e.g., Fjeldly el al. 
1998), 



F s = s/2— f ( — 

L d p \V th 



(1.9) 



where the function / is defined by 



/(«) = ± /[exp(-w) + m 



l] + -^[exp( M ) 



1 ], 



( 1 . 10 ) 



and 




( 1 . 11 ) 



is called the Debye length. In (1.10), a positive sign should be chosen for a positive \jj. 
and a negative sign corresponds to a negative i/A- 
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Using Gauss’ law, we can relate the total charge Q s per unit area (carrier charge and 
depletion charge) in the semiconductor to the surface electric field by 

Qs — —s s F s . (1.12) 

At the flat-band condition ( V = Vfb). the surface charge is equal to zero. In accumulation 
( V < Vfb). the surface charge is positive, and in depletion and inversion ( V > Vfb), the 
surface charge is negative. In accumulation (when |i/f s | exceeds a few times V*) and 
in strong inversion, the mobile sheet charge density is proportional to exp[|i/ f s |/(2V t h)]). 
In depletion and weak inversion, the depletion charge is dominant and its sheet density 
varies as i// s l/2 . Figure 1.5 shows |£? s | versus i// s for p-typc silicon with a doping density 
of 10 16 /cm 3 . 

In order to relate the semiconductor surface potential to the applied voltage V, we 
have to investigate how this voltage is divided between the insulator and the semicon- 
ductor. Using the condition of continuity of the electric flux density at the semiconduc- 
tor-insulator interface, we find 

s s F s = s i F i , (1.13) 

where s, is the permittivity of the oxide layer and F, is the constant electric field in the 
insulator (assuming no space charge). Hence, with an insulator thickness d,, the voltage 
drop across the insulator becomes Fid\. Accounting for the flat-band voltage, the applied 
voltage can be written as 

V = Vfb + fs + e t F t /c u (1.14) 

where Ci = 8,/d, is the insulator capacitance per unit area. 




Figure 1.5 Normalized total semiconductor charge per unit area versus normalized surface potential 
for p-type Si with N- d = 10 16 /cm 3 . Q th = (2 s s qN d V th ) 1/2 « 9.3 x 10~ 9 C/cm 2 and V th « 0.026 V at 
T = 300 K. The arrows indicate flat-band condition and onset of strong inversion 
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1.2.2 Threshold Voltage 

The threshold voltage V — Vt, corresponding to the onset of the strong inversion, is one 
of the most important parameters characterizing metal-insulator-semiconductor devices. 
As discussed above, strong inversion occurs when the surface potential \jr & becomes equal 
to 2 (pb. For this surface potential, the charge of the free carriers induced at the insula- 
tor-semiconductor interface is still small compared to the charge in the depletion layer, 
which is given by 

GdT = -qN^ddj = —yj As s qN a (p b , (1.15) 

where d^i — (4 Sstpb/qNf) 112 is the width of the depletion layer at threshold. Accordingly, 
the electric held at the semiconductor-insulator interface becomes 

^sT = -GdT/£s = y/AqNa.fpb/Es. (1.16) 

Hence, substituting the threshold values of i/'s and F s in (1.14), we obtain the following 
expression for the threshold voltage: 

Vt = F F b + 2 <p b + y%^A^/ci. (1.17) 

Figure 1.6 shows typical calculated dependencies of V T on doping level and dielec- 
tric thickness. 

For the MOS structure shown in Figure 1.2, the application of a bulk bias Vb is simply 
equivalent to changing the applied voltage from V to V — Vb- Hence, the threshold 




Figure 1.6 Dependence of MOS threshold voltage on the substrate doping level for different 
thicknesses of the dielectric layer. Parameters used in calculation: energy gap, 1.12eV; effec- 
tive density of states in the conduction band, 3.22 x 10 25 /m 3 ; effective density of states in the 
valence band, 1.83 x 10 25 /m 3 ; semiconductor permittivity, 1.05 x 10~ 10 F/m; insulator permittivity, 
3.45 x 10~ u F/m; flat-band voltage, —IV; temperature: 300 K. Reproduced from Lee K., Shur M., 
Fjeldly T. A., and Ytterdal T. (1993) Semiconductor Device Modeling for VLSI . Prentice Hall. 
Englewood Cliffs, NJ 
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referred to the ground potential is simply shifted by V B . However, the situation will be 
different in a MOSFET where the conducting layer of mobile electrons may be maintained 
at some constant potential. Assuming that the inversion layer is grounded, V B biases the 
effective junction between the inversion layer and the substrate, changing the amount of 
charge in the depletion layer. In this case, the threshold voltage becomes 



Vj = Vf B + 2 (pi, + ^2s s qN a (2<pb — V B )/ci. (1.18) 

Note that the threshold voltage may also be affected by so-called fast surface states at 
the semiconductor-oxide interface and by fixed charges in the insulator layer. However, 
this is not a significant concern with modern day fabrication technology. 

As discussed above, the threshold voltage separates the subthreshold regime, where 
the mobile carrier charge increases exponentially with increasing applied voltage, from 
the above-threshold regime, where the mobile carrier charge is linearly dependent on the 
applied voltage. However, there is no clear point of transition between the two regimes, so 
different definitions and experimental techniques have been used to determine Vj. Some- 
times (1.17) and (1.18) are taken to indicate the onset of so-called moderate inversion, 
while the onset of strong inversion is defined to be a few thermal voltages higher. 



1.2.3 MOS Capacitance 



In a MOS capacitor, the metal contact and the neutral region in the doped semiconductor 
substrate are separated by the insulator layer, the channel, and the depletion region. Hence, 
the capacitance C mos of the MOS structure can be represented as a series connection of 
the insulator capacitance C\ — Ss, /d \ , where S is the area of the MOS capacitor, and the 
capacitance of the active semiconductor layer C s , 

C,C S 

C mos = — !-i-. (1.19) 

Ci + c s 

The semiconductor capacitance can be calculated as 



C s = S 



dg s 

diAs 



( 1 . 20 ) 



where Q s is the total charge density per unit area in the semiconductor and i// s is the surface 
potential. Using (1.9) to (1.12) for Q s and performing the differentiation, we obtain 



C - 



c« 



V2/0As/Vth) 



1 1 — exp 




Hpo 

N„ 





(1.21) 



Here, C so = Ss s /L Dp is the semiconductor capacitance at the flat-band condition (i.e., 
for i/f s = 0) and L Dp is the Debye length given by (1.11). Equation (1.14) describes the 
relationship between the surface potential and the applied bias. 

The semiconductor capacitance can formally be represented as the sum of two capaci- 
tances - a depletion layer capacitance Cd and a free carrier capacitance Cf c . Cf c together 
with a series resistance R G R describes the delay caused by the generation/recombination 
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mechanisms in the buildup and removal of inversion charge in response to changes in the 
bias voltage (see following text). The depletion layer capacitance is given by 



C d = Ss s /d A , (1.22) 

where 




is the depletion layer width. In strong inversion, a change in the applied voltage will pri- 
marily affect the minority carrier charge at the interface, owing to the strong dependence 
of this charge on the surface potential. This means that the depletion width reaches 
a maximum value with no significant further increase in the depletion charge. This 
maximum depletion width d dT can be determined from (1.23) by applying the thresh- 
old condition, i// s = 2 <p b . The corresponding minimum value of the depletion capacitance 
is C d x = Ss & /d d j. 

The free carrier contribution to the semiconductor capacitance can be formally ex- 
pressed as 

Cfc = C s -C d . (1.24) 

As indicated, the variation in the minority carrier charge at the interface comes from the 
processes of generation and recombination mechanisms, with the creation and removal of 
electron-hole pairs. Once an electron-hole pair is generated, the majority carrier (a hole 
in p- type material and an electron in n-type material) is swept from the space charge 
region into the substrate by the electric field of this region. The minority carrier is swept 
in the opposite direction toward the semiconductor-insulator interface. The variation in 
minority carrier charge at the semiconductor-insulator interface therefore proceeds at a 
rate limited by the time constants associated with the generation/recombination processes. 
This finite rate represents a delay, which may be represented electrically in terms of an 
RC product consisting of the capacitance Cf c and the resistance Rqr, as reflected in the 
equivalent circuit of the MOS structure shown in Figure 1.7. The capacitance Cf c becomes 
important in the inversion regime, especially in strong inversion where the mobile charge 
is important. The resistance R s in the equivalent circuit is the series resistance of the 
neutral semiconductor layer and the contacts. 



C d 




Figure 1.7 Equivalent circuit of the MOS capacitor. Reproduced from Shur M. (1990) Physics 
of Semiconductor Devices, Prentice Hall, Englewood Cliffs, NJ 
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This equivalent circuit is clearly frequency-dependent. In the low-frequency limit, we 
can neglect the effects of Rqr and R s to obtain (using C s = C c i + Cf c ) 



r° 

mos 



c s c j 

C s + C\ 



In strong inversion, we have C s 5>> Cj, which gives 



C° 

mos 



Ci 



(1.25) 



(1.26) 



at low frequencies. 

In the high-frequency limit, the time constant of the generation/recombination mech- 
anism will be much longer than the signal period (/? GR Cf c 5 s> 1//) and Q effectively 
shunts the lower branch of the parallel section of the equivalent in Figure 1.7. Hence, the 
high-frequency, strong inversion capacitance of the equivalent circuit becomes 



C 



oo 

mos 



C dT C i 
CdT + Ci' 



(1.27) 



The calculated dependence of C mos on the applied voltage for different frequencies is 
shown in Figure 1.8. For applied voltages well below threshold, the device is in accumu- 
lation and C mos equals C\. As the voltage approaches threshold, the semiconductor passes 
the flat-band condition where C mos has the value Cfb, and then enters the depletion and 
the weak inversion regimes where the depletion width increases and the capacitance value 
drops steadily until it reaches the minimum value at threshold given by (1.27). The cal- 
culated curves clearly demonstrate how the MOS capacitance in the strong inversion 
regime depends on the frequency, with a value of C“ s at high frequencies to C l at low 
frequencies. 




Applied voltage 



Figure 1.8 Calculated dependence of C mos on the applied voltage for different frequencies. Param- 
eters used: insulator thickness, 2 x 10~ 8 m; semiconductor doping density, 10 15 /cm 3 ; generation 
time, 10“ 8 s. Reproduced from Shur M. (1990) Physics of Semiconductor Devices, Prentice Hall, 
Englewood Cliffs, NJ 
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We note that in a MOSFET, where the highly doped source and drain regions act 
as reservoirs of minority carriers for the inversion layer, the time constant RqrC f C must 
be substituted by a much smaller time constant corresponding to the time needed for 
transporting carriers from these reservoirs in and out of the MOSFET gate area. Conse- 
quently, high-frequency strong inversion MOSFET gate-channel C-V characteristics will 
resemble the zero frequency MOS characteristic. 

Since the low-frequency MOS capacitance in the strong inversion is close to Q, the 
induced inversion charge per unit area can be approximated by 



qn s « a(V - V T ). (1.28) 

This equation serves as the basis of a simple charge control model (SCCM) allowing us 
to calculate MOSFET current -voltage characteristics in strong inversion. 

From measured MOS C-V characteristics, we can easily determine important param- 
eters of the MOS structure, including the gate insulator thickness, the semiconductor 
substrate doping density, and the flat-band voltage. The maximum measured capacitance 
Cmax (capacitance C, in Figure 1.7) yields the insulator thickness 

d\ ~ SeJ C max . (1.29) 

The minimum measured capacitance C m j n (at high frequency) allows us to find the 
doping concentration in the semiconductor substrate. First, we determine the depletion 
capacitance in the strong inversion regime using (1.27), 



1/ C min = 1/ C dT + 1/Cj. 



(1.30) 



From Cdx we obtain the thickness of the depletion region at threshold as 



d A j — Ss s / CdT- 



(1.31) 



Then we calculate the doping density N. d using (1.23) with i// s = 2<p b and (1.2) for i p b . 
This results in the following transcendental equation for /V a : 



iVa = 



4£ s Vth 

qdd7 



In 




(1.32) 



This equation can easily be solved by iteration or by approximate analytical techniques. 
Once d[ and N a have been obtained, the device capacitance Cfb under flat-band condi- 
tions can be determined using C s = C so ((1.21) at flat-band condition) in combination 
with (1.19): 



Cfb 



CsoQ 



Cso + Ci 



Ss s Sj 

£s^i “b ei^Dp 



(1.33) 



The flat-band voltage Vfb is simply equal to the applied voltage corresponding to this 
value of the device capacitance. 

We note that the above characterization technique applies to ideal MOS structures. 
Different nonideal effects, such as geometrical effects, nonuniform doping in the substrate. 
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interface states, and mobile charges in the oxide may influence the C—V characteristics 
of the MOS capacitor. 



1.2.4 MOS Charge Control Model 

Well above threshold, the charge density of the mobile carriers in the inversion layer can 
be calculated using the parallel plate charge control model of (1.28). This model gives 
an adequate description for the strong inversion regime of the MOS capacitor, but fails 
for applied voltages near and below threshold (i.e., in the weak inversion and depletion 
regimes). Several expressions have been proposed for a unified charge control model 
(UCCM) that covers all the regimes of operation, including the following (see Byun 
et al. 1990): 

V - V T = q(n s - n 0 )/c a + qV th In , (1.34) 

where c a ~ Cj is approximately the insulator capacitance per unit area (with a small 
correction for the finite vertical extent of the inversion channel, see Lee et al. (1993)), 
ii q — n s (V — Vf) is the density of minority carriers per unit area at threshold, and tj is the 
so-called subthreshold ideality factor, also known as the subthreshold swing parameter. 
The ideality factor accounts for the subthreshold division of the applied voltage between 
the gate insulator and the depletion layer, and I !tj represents the fraction of this voltage 
that contributes to the interface potential. A simplified analysis gives 



h = 1 + C d /Ci, (1.35) 

n 0 = nVthC a /2q. (1.36) 




(V- V T )/ V v th 



Figure 1.9 Comparison of various charge control expression for the MOS capacitor. Equ- 
ation (1.38) is a close approximation to (1.34), while the above- and below-threshold approxi- 
mations are given by (1.28) and (1.37), respectively. Reproduced from Fjeldly T. A., Ytterdal T., 
and Shur M. (1998) Introduction to Device Modeling and Circuit Simulation, John Wiley & Sons, 
New York 
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In the subthreshold regime, (1.34) approaches the limit 



n s = n Q exp 



V - V T 



(1.37) 



We note that (1.34) does not have an exact analytical solution for the inversion charge 
in terms of the applied voltage. However, for many purposes, the following approximate 
solution may be suitable: 



n s = 2 n 0 In 



' ! fV-V T 

1 + t exp — — 
2 \ 



(1.38) 



This expression reproduces the correct limiting behavior both in strong inversion and 
in the subthreshold regime, although it deviates slightly from (1.34) near threshold. The 
various charge control expressions of the MOS capacitor are compared in Figure 1.9. 



1.3 BASIC MOSFET OPERATION 

In the MOSFET, an inversion layer at the semiconductor-oxide interface acts as a con- 
ducting channel. For example, in an n -channel MOSFET, the substrate is p-typc silicon 
and the inversion charge consists of electrons that form a conducting channel between 
the n + ohmic source and the drain contacts. At DC conditions, the depletion regions and 
the neutral substrate provide isolation between devices fabricated on the same substrate. 
A schematic view of the ^-channel MOSFET is shown in Figure 1.10. 

As described above for the MOS capacitor, inversion charge can be induced in the 
channel by applying a suitable gate voltage relative to other terminals. The onset of 
strong inversion is defined in terms of a threshold voltage Vj being applied to the gate 
electrode relative to the other terminals. In order to assure that the induced inversion 
channel extends all the way from source to drain, it is essential that the MOSFET gate 
structure either overlaps slightly or aligns with the edges of these contacts (the latter is 
achieved by a self-aligned process). Self-alignment is preferable since it minimizes the 
parasitic gate-source and gate-drain capacitances. 



Gate 



Source 



Drain 




boundary 

/ 



Substrate contact 



Figure 1.10 Schematic view of an «-channel MOSFET with conducting channel and deple- 
tion region 
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When a drain-source bias Vds is applied to an n -channel MOSFET in the above- 
threshold conducting state, electrons move in the channel inversion layer from source to 
drain. A change in the gate-source voltage Vgs alters the electron sheet density in the 
channel, modulating the channel conductance and the device current. For V GS > Vj in 
an n -channel device, the application of a positive Vds gives a steady voltage increase 
from source to drain along the channel that causes a corresponding reduction in the local 
gate-channel bias Vqx (here X signifies a position x within the channel). This reduction 
is greatest near drain where Vox equals the gate-drain bias V G d- 

Somewhat simplistically, we may say that when V GD = V T , the channel reaches thresh- 
old at the drain and the density of inversion charge vanishes at this point. This is the 
so-called pinch-off condition, which leads to a saturation of the drain current / ( | S . The 
corresponding drain-source voltage, Vos = Vsat, is called the saturation voltage. Since 
V GD = Vgs — Vds, we find that V S at = V G s — Vf. (This is actually a result of the SCCM, 
which is discussed in more detail in Section 1.4.1.) 

When Vds > Vsat, the pinched-off region near drain expands only slightly in the 
direction of the source, leaving the remaining inversion channel intact. The point of 
transition between the two regions, x = x p , is characterized by Vxs (x p ) & Vsat, where 
Vxs (x p ) is the channel voltage relative to source at the transition point. Flence, the drain 
current in saturation remains approximately constant, given by the voltage drop Vsat 
across the part of the channel that remain in inversion. The voltage Vds — Vsat across the 
pinched-off region creates a strong electric field, which efficiently transports the electrons 
from the strongly inverted region to the drain. 

Typical current- voltage characteristics of a long-channel MOSFET, where pinch-off is 
the predominant saturation mechanism, are shown in Figure 1.11. Flowever, with shorter 
MOSFET gate lengths, typically in the submicrometer range, velocity saturation will 
occur in the channel near drain at lower Vds than that causing pinch-off. This leads to 
more evenly spaced saturation characteristics than those shown in this figure, more in 




Figure 1.11 Current-voltage characteristics of an /(-channel MOSFET with current saturation 
caused by pinch-off (long-channel case). The intersections with the dotted line indicate the onset 
of saturation for each characteristic. The threshold voltage is assumed to be Vp = 1 V. Reproduced 
from Fjeldly T. A., Ytterdal T., and Shur M. (1998) Introduction to Device Modeling and Circuit 
Simulation, John Wiley & Sons, New York 
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agreement with those observed for modern devices. Also, phenomena such as a finite 
channel conductance in saturation, a drain bias-induced shift in the threshold voltage, 
and an increased subthreshold current are important consequences of shorter gate lengths 
(see Section 1.5). 



1.4 BASIC MOSFET MODELING 

Analytical or semianalytical MOSFET models are usually based on the so-called grad- 
ual channel approximation (GCA). Contrary to the situation in the ideal two-terminal 
MOS device, where the charge density profile is determined from a one-dimensional 
Poisson’s equation (see Section 1.2), the MOSFET generally poses a two-dimensional 
electrostatic problem. The reason is that the geometric effects and the application of a 
drain-source bias create a lateral electric field component in the channel, perpendicu- 
lar to the vertical field associated with the ideal gate structure. The GCA states that, 
under certain conditions, the electrostatic problem of the gate region can be expressed 
in terms of two coupled one-dimensional equations - a Poisson’s equation for determin- 
ing the vertical charge density profile under the gate and a charge transport equation 
for the channel. This allows us to determine self-consistently both the channel poten- 
tial and the charge profile at any position along the gate. A direct inspection of the 
two-dimensional Poisson’s equation for the channel region shows that the GCA is valid 
if we can assume that the electric field gradient in the lateral direction of the chan- 
nel is much less than that in the vertical direction perpendicular to the channel (Lee 
et al. 1993). 

Typically, we find that the GCA is valid for long-channel MOSFETs, where the ratio 
between the gate length and the vertical distance of the space charge region from the 
gate electrode, the so-called aspect ratio, is large. However, if the MOSFET is biased in 
saturation, the GCA always becomes invalid near drain as a result of the large lateral field 
gradient that develops in this region. In Figure 1.12, this is schematically illustrated for 
a MOSFET in saturation. 

Next, we will discuss three relatively simple MOSFET models, the simple charge 
control model, the Meyer model, and the velocity saturation model. These models, with 
extensions, can be identified with the models denoted as MOSFET Level 1, Level 2, and 
Level 3 in SPICE. 



Gate 




Substrate contact 



Figure 1.12 Schematic representation of a MOSFET in saturation, where the channel is divided 
into a nonsaturated region where the GCA is valid and a saturated region where the GCA is invalid 
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We should note that the analysis that follows is based on idealized device structures. 
Especially in modern MOSFET/CMOS technology, optimized for high-speed and low- 
power applications, the devices are more complex. Additional oxide and doping regions 
are used for the purpose of controlling the threshold voltage and to avoid deleterious 
effects of high electric fields and so-called short- and narrow-channel phenomena asso- 
ciated with the steady downscaling device dimensions. These effects will be discussed 
more in Section 1.5 and in later chapters. 



1.4.1 Simple Charge Control Model 

Consider an n -channel MOSFET operating in the above-threshold regime, with a gate 
voltage that is sufficiently high to cause inversion in the entire length of the channel at 
zero drain-source bias. We assume a long-channel device, implying that GCA is applicable 
and that the carrier mobility can be taken to be constant (no velocity saturation). As a 
hrst approximation, we can describe the mobile inversion charge by a simple extension 
of the parallel plate expression (1.28), taking into account the potential variation V(x) 
along the channel, that is, 

qn s (x) « CitVoT - V(x)], (1.39) 



where Vgt = Cos — Vj. This simple charge control expression implies that the variation 
of the depletion layer charge along the channel, which depends on V(x), is negligible. 
Furthermore, since the expression relies on GCA, it is only applicable for the nonsaturated 
part of the channel. Saturation sets in when the conducting channel is pinched-off at the 
drain side, that is, for n s (x — L) > 0. Using the pinch-off condition and V (x — L) — 
Vds in (1-39), we obtain the following expression for the saturation drain voltage in 
the SCCM: 



Us at = Vgt. 



(1.40) 



The threshold voltage in this model is given by (1.18), where we have accounted for 
the substrate bias Vrs relative to the source. We note that this expression is only valid for 
negative or slightly positive values of Vbs, when the junction between the source contact 
and the /^-substrate is either reverse-biased or slightly forward-biased. For high Vrs, a 
significant leakage current will take place. 

Figure 1.13 shows an example of calculated dependences of the threshold voltage 
on substrate bias for different values of gate insulator thickness. As can be seen from 
this figure and from (1.18), the threshold voltage decreases with decreasing insulator 
thickness and is quite sensitive to the substrate bias. This so-called body effect is essential 
for device characterization and in threshold voltage engineering. For real devices, it is 
important to be able to carefully adjust the threshold voltage to match specific application 
requirements. 

Equation (1.18) also shows that Vj can be adjusted by changing the doping or by using 
different gate metals (including heavily doped polysilicon). As discussed in Section 1.2, 
the gate metal affects the flat-band voltage through the work-function difference between 
the metal and the semiconductor. Threshold voltage adjustment by means of doping is 
often performed with an additional ion implantation through the gate oxide. 
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Figure 1.13 Body plot, the dependence of the threshold voltage on substrate bias in MOSFETs 
with different insulator thicknesses. Parameters used in the calculation: flat-band voltage —IV, 
substrate doping density 10 22 /m 3 , temperature 300 K. The slope of the plots are given in terms of 
the body-effect parameter y = (2s s qN d ) l ^ 2 /ci. Reproduced from Fjeldly T. A., Ytterdal T., and 
Shur M. (1998) Introduction to Device Modeling and Circuit Simulation, lohn Wiley & Sons, 
New York 

Assuming a constant electron mobility /r„, the electron velocity can be written as 
v„ — —ju„ d V/dx. Neglecting the diffusion current, which is important only near threshold 
and in the subthreshold regime, the absolute value of the drain current can be written as 

Ids = Wn„qn s F, (1.41) 

where F = |dV/d.r| is the magnitude of the electric field in the channel and W is the 
channel width. Integrating this expression over the gate length and using the fact that 7d S 
is independent of position x, we obtain the following expression for the current-voltage 
characteristics: 

W fi n Ci \ (Vgt — Vds/2) Vds, f° r Vds < Vsat = Vgt 

* Kt/ 2. for Vds > Vsat ' <1A2> 

As implied above, the pinch-off condition implies a vanishing carrier concentration at 
the drain side of the channel. Hence, at a first glance, one might think that the drain current 
should also vanish. However, instead the saturation drain current 7d sa t is determined by the 
resistance of nonsaturated part of the channel and the current across it. In fact, this channel 
resistance changes very little when V DS increases beyond Vsat, since the pinch-off point 
x p moves only slightly away from the drain, leaving the nonsaturated part of the channel 
almost intact. Moreover, the voltage at the pinch-off point will always be approximately 
Vsat since the threshold condition at x p is determined by Vq — V (x p ) = Vj, or V (x p ) = 
V GT = Vsat- Hence, since the resistance of the nonsaturated part is constant and the 
voltage across it is constant, 7d sa t will also remain constant. Therefore, the saturation 
current 7sat is determined by substituting Vds = Vsat from (1.40) into the nonsaturation 
expression in (1.42). In reality, of course, the electron concentration never vanishes, nor 
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does the electric field become infinite. This is simply a consequence of the breakdown 
of GCA near drain in saturation, pointing to the need for a more accurate and detailed 
analysis of the saturation regime. 

The MOSFET current- voltage characteristics shown in Figure 1.11 were calculated 
using this simple charge control model. 

Important device parameters are the channel conductance, 



§d 



dh 
9 Vds 



Vgs 



\P(V GT — ^Ds). for ^DS < f^SAT 
jo, for Eds > Vsat 



and the transconductance, 



9/d 

8m 9 Vgs 



^DS 



| P Vds , for Vds < Vsat 
1 P Vgt, for Vds > Vsat 



(1.43) 



(1.44) 



where p = W p „ c\ / L is called the transconductance parameter. As can be seen from these 
expressions, high values of channel conductance and transconductance are obtained for 
large electron mobilities, large gate insulator capacitances (i.e., thin gate insulator layers), 
and large gate width to length ratios. 

The SCCM was developed at a time when the MOSFET gate lengths were typically 
tens of micrometers long, justifying some of the above approximations. With today’s deep 
submicron technology, however, the SCCM is clearly not applicable. We therefore intro- 
duce two additional models that include significant improvements. In the first of these, 
the Meyer model, the lateral variation of the depletion charge in the channel is taken into 
account. In the second, the velocity saturation model (VSM), we introduce the effects of 
saturation in the carrier velocity. The former is important at realistic levels of substrate 
doping, and the latter is important because of the high electric fields generated in short- 
channel devices. Additional effects of small dimensions and high electric fields will be 
discussed in Section 1.5. 



1.4.2 The Meyer Model 

The total induced charge q s per unit area in the semiconductor of an n -channel MOSFET, 
including both inversion and depletion charges, can be expressed in terms of Gauss’s law 
as follows, assuming that the source and the semiconductor substrate are both connected 
to ground (see Section 1.2), 

q s = -c[V G s - V FB - 2 <p h - V (x)]. (1.45) 

Here, the content of the bracket expresses the voltage drop across the insulator layer. 
The induced sheet charge density includes both the inversion charge density q\ = — qn s 
and the depletion charge density qd, that is, q s = cp + q d . Using (1.15) and including the 
added channel-substrate bias caused by the channel voltage, the depletion charge per unit 
area can be expressed as 



q d = -qN a d A = -^2s & qN^[2(p b + V(x)], 



(1.46) 
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where d& is the local depletion layer width at position x. Hence, the inversion sheet charge 
density becomes 



q\ = —qn s = -Ci[V G s - V F B -2 <p b - V (x)\ + y/2s s qN a [2(p b + V(x)\. (1.47) 

A constant electron mobility is also assumed in the Meyer model. Hence, the nonsat- 
urated drain current can again be obtained by substituting the expression for n s in 

7ds = W/L„qn s (x)F(x) (1.48) 



to give (Meyer 1971) 



7ds — 



WUnCi 



L 

2^/2s s qN. 

3g 



Vds 

Vgs — Vfb — 2i fh — ) Vds 



[(Vds + 2 (pb) 3 ^ — (2 < ] 



The saturation voltage is obtained using the pinch-off condition n & = 0, 

s & qN a 



Vsat = Vgs — 2 <p b — Vfb + 



. _ / , 2c 1 2 (V G s — Vfb)~ 

V £sqN a 



(1.49) 



(1.50) 



At low doping levels, we see that Vsat approaches Vqt, which is the result found for the 
simple charge control model. 



1.4.3 Velocity Saturation Model 

The linear velocity-held relationship (constant mobility) used in the above MOSFET 
models works reasonably well for long-channel devices. However, the implicit notion of 
a diverging carrier velocity as we approach pinch-off is, of course, unphysical. Instead, 
current saturation is better described in terms of a saturation of the carrier drift velocity 
when the electric held near drain becomes sufficiently high. The following two-piece 
model is a simple, hrst approximation to a realistic velocity-held relationship: 



v(F) = 



\dnF 

It’s 



for F < F s 
for F > F s ’ 



(1.51) 



where F = d V (x ) /dv is the magnitude of lateral electrical held in the channel, v s is 
the saturation velocity, and F s — t> s //x„ is the saturation held. In this description, current 
saturation in FETs occurs when the held at the drain side of the gate reaches the saturation 
held. A somewhat more precise expression, which is particularly useful for n -channel 
MOSFETs, is the so-called Sodini model (Sodini et al. 1984), 



jj.fi F 



v(F) = 



1 + F/2F, 

v s 



for F < 2 F & 
for F > 2 F s 



(1.52) 
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Figure 1.14 Velocity-field relationships for charge carriers in silicon MOSFETs. The electric 
field and the velocity are normalized to F s and v s , respectively. Two of the curves are calculated 
from (1.53) using m = 1 for holes and m = 2 for electrons. The curve marked m = oo corresponds 
to the linear two-piece model in (1.51). The Sodini model (1.52) is also shown 



Even more realistic velocity-field relationships for MOSFETs are obtained from 



v{F) = 



IJ-F 

[1 + (F/F s ) m ]V'" 



(1.53) 



where m = 2 and m — 1 are reasonable choices for n -channel and /t-channel MOSFETs, 
respectively. The two-piece model in (1.51) corresponds to m = oo in (1.53). Figure 1.14 
shows different velocity-field models for electrons and holes in silicon MOSFETs. 

Using the simple velocity-field relationship of (1.51), current -voltage characteristics 
can easily be derived from either the SCCM or the Meyer model, since the form of 
the nonsaturated parts of the characteristics will be the same as before (see (1.42) and 
(1.49)). However, the saturation voltage will now be identical to the drain-source voltage 
that initiates velocity saturation at the drain side of the channel. In terms of (1.51), this 
occurs when F(L) — F s . Hence, using this condition in combination with the SCCM, we 
obtain the following expressions for the drain current and the saturation voltage: 



W/J-nCj ^ | VqtFds ~ Ups /2, for Vos < Usat 

L 1(Vgt — Vsat)Fl, for Eds > Vsat 



Fsat — Fgt — Vl 



\A + (Vqt/ Vl) 



(1.54) 

(1.55) 



where Vl = F S L — Lv s /fi n . The Meyer VSM leads to a much more complicated rela- 
tionship for Vs at- 

For large values of Vl such that Vl Vgt, the square root terms in (1.55) may be 
expanded into a Taylor series, yielding the previous long-channel result for the SCCM 
without velocity saturation. Assuming, as an example, that Vgt = 3 V, /r„ = 0.08 m 2 /Vs, 
and v s — 1 x 10 5 m/s, we find that velocity saturation effects may be neglected for L 5>> 
2.4 |xm. Hence, velocity saturation is certainly important in modern MOSFETs with gate 
lengths typically in the deep submicrometer range. 
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In the opposite limiting case, when V\ <£ V GX , we obtain V S at ^ Vl and /dsat ~ 
PVlVqt- Since 7d sa t is proportional to V GX in long-channel devices and proportional to 
Vgt in short-channel devices, we can use this difference to identify the presence of 
short-channel effects on the basis of measured device characteristics. 

1.4.4 Capacitance Models 

For the simulation of dynamic events in MOSFET circuits, we also have to account for 
variations in the stored charges of the devices. In a MOSFET, we have stored charges in 
the gate electrode, in the conducting channel, and in the depletion layers. Somewhat sim- 
plified, the variation in the stored charges can be expressed through different capacitance 
elements, as indicated in Figure 1.15. 

We distinguish between the so-called parasitic capacitive elements and the capacitive 
elements of the intrinsic transistor. The parasitics include the overlap capacitances between 
the gate electrode and the highly doped source and drain regions (C os and C 0 d), the junction 
capacitances between the substrate and the source and drain regions (Cj S and Cjd), and 
the capacitances between the metal electrodes of the source, the drain, and the gate. 

The semiconductor charges of the intrinsic gate region of the MOSFET are divided 
between the mobile inversion charge and the depletion charge, as indicated in Figure 1.15. 
In addition, these charges are nonuniformly distributed along the channel when drain- 
source bias is applied. Hence, the capacitive coupling between the gate electrode and the 
semiconductor is also distributed, making the channel act as an RC transmission line. In 
practice, however, because of the short gate lengths and limited bandwidths of FETs, the 
distributed capacitance of the intrinsic device is usually very well represented in terms 
of a lumped capacitance model, with capacitive elements between the various intrinsic 
device terminals. 

An accurate modeling of the intrinsic device capacitances still requires an analysis 
of how the inversion charge and the depletion charge are distributed between source, 
drain, and substrate for different terminal bias voltages. As discussed by Ward and Dutton 
(1978), such an analysis leads to a set of charge-conserving and nonreciprocal capacitances 
between the different intrinsic terminals (nonreciprocity means C, ; / C,, , where i and j 
denote source, drain, gate, or substrate). 



Intrinsic MOSFET 




Figure 1.15 Intrinsic and parasitic capacitive elements of the MOSFET. Reproduced from 
Fjeldly T. A., Ytterdal T., and Shur M. (1998) Introduction to Device Modeling and Circuit 
Simulation, John Wiley & Sons, New York 
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In a simplified and straightforward analysis by Meyer (1971) based on the SCCM, a 
set of reciprocal capacitances (C,- 7 = C ;i - ) were obtained as derivatives of the total gate 
charge with respect to the various terminal voltages. Although charge conservation is 
not strictly enforced in this case, since the Meyer capacitances represent only a subset 
of the Ward-Dutton capacitances, the resulting errors in circuit simulations are usually 
small, except in some cases of transient analyzes of certain demanding circuits. Here, 
we first consider Meyer’s capacitance model for the long-channel case, but return with 
modifications of this model and comments on charge-conserving capacitance models in 
Section 1.5.3. 

In Meyer’s capacitance model, the distributed intrinsic MOSFET capacitance can be 
split into the following three lumped capacitances between the intrinsic terminals: 



C GS = 



BQg 

3Vgs 



C _ dQo 
v OD ,v OB ’ G ° 9F GD 



dQa_ 

v 0S ,v 0B ’ GB 9V GB 



Vgs > Vqd 



(1.56) 



where Q a is the total intrinsic gate charge. The intrinsic MOSFET equivalent circuit 
corresponding to this model is shown in Figure 1.16. 

In general, the gate charge reflects both the inversion charge and the depletion charge 
and can therefore be written as Qa — Qa + Qc,d- However, in the SCCM for the drain 
current, the depletion charge is ignored in strong inversion, except for its influence on the 
threshold voltage (see (1.18)). Likewise, in the Meyer capacitance model, the gate-source 
capacitance C G s and the gate-drain capacitance C G d can be assumed to be dominated by 
the inversion charge. Here, we include gate-substrate capacitance C GB in the subthreshold 
regime, where the depletion charge is dominant. 

The contribution of the inversion charge to the gate charge is determined by integrating 
the sheet charge density given by (1.39), over the gate area, that is, 

Qa = We; [ [VciT - V (x)] dx. (1.57) 

Jo 



Gate 




Figure 1.16 Large-signal equivalent circuit of intrinsic MOSFET based on Meyer’s capacitance 
model. Reproduced from Fjeldly T. A., Ytterdal T., and Shur M. (1998) Introduction to Device 
Modeling and Circuit Simulation , John Wiley & Sons, New York 
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From (1.41), we notice that d.r = W il ii c 1 (Vqj — V ) cl V/ / cls , which allows us to make a 
change of integration variable from x to V in (1.57). Hence, we obtain for the nonsatu- 
rated regime 



Q Gi = 



W[l n Cf 

Lids 




(Vgt — V) 2 dV — —Q 



(Vgs — Vr) 3 — (Vqd — Vr) 3 

(Vgs — Vx) 2 — (V G D — V T ) 2 



(1.58) 



where Q is the total gate oxide capacitance and where we expressed 7 ds using (1.42) and 
replaced V DS by V G s — Von everywhere. 

Using the above relationships, the following strong inversion, long-channel Meyer 
capacitances are obtained: 



C GS 




1 



/ Vqt — Vds \ 
\2Vgt — Vds / 



C G d — -C i 
C G b = 0. 



V G t V 
2 Vgt — Vds / 



(1.59) 

(1.60) 
(1.61) 



We recall that Vsat = V G t is the saturation voltage in the SCCM. The capacitances at 
saturation are found by replacing Vds = Vsat in the above expressions, that is, 

Cgss = -Q, Cgds = Cgbs = 0. (1.62) 

This result indicates that in saturation, a small change in the applied drain-source voltage 
does not contribute to the gate or the channel charge, since the channel is pinched off. 
Instead, the entire channel charge is assigned to the source terminal, giving a maximum 
value of the capacitance C G s- Normalized dependencies of the Meyer capacitances C G s 
and C G d on bias conditions are shown in Figure 1.17. 

In the subthreshold regime, the inversion charge becomes negligible compared to the 
depletion charge, and the MOSFET gate-substrate capacitance will be the same as that 
of a MOS capacitor in depletion, with a series connection of the gate oxide capacitance 
C i and the depletion capacitance C (i (see (1.19) to (1.23)). According to the discussion in 
Section 1.2, the applied gate-substrate voltage V G b can be subdivided as follows: 



Vgb — Vfb + ^s~ Cldep/Ci, (1.63) 

where Vfb is the flat-band voltage, i/'s is the potential across the semiconductor depletion 
layer (i.e., the surface potential relative to the substrate interior), and — c/dep/c'i is the 
voltage drop across the oxide. In the depletion approximation, the depletion charge per 
unit area ^ dep is related to i J/ s by ^ dep = — yc^l 11 where y — (2e s glV a )V 2 /c; is the body- 
effect parameter. Using this relationship to substitute for i// s in (1.63), we find 

0Gd = —WLq dtp = yQ (7 y 2 /4 + V G b — Vfb — Y 1 2^ , 



(1.64) 
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Figure 1.17 Normalized strong inversion Meyer capacitances according to (1.59) to (1.62) versus 
(a) drain-source bias and (b) gate-source bias. Note that Vsat = Vgt in this model. Reproduced 
from Fjeldly T. A., Ytterdal T., and Shur M. (1998) Introduction to Device Modeling and Circuit 
Simulation, John Wiley & Sons, New York 



from which we obtain the following subthreshold capacitances: 



Cgb 



Q 

V 7 ! + 4(Vqb — Vfb)// 2 



Cq s — Cgd — 0. 



(1.65) 



We note that (1.65) gives Cgb = Q at the flat-band condition, which is different from 
the flat-band capacitance of (1.33). This discrepancy arises from neglecting the effects 
of the free carriers in the subthreshold regime in the present simplified treatment. For 
the same reason, we observe the presence of discontinuities in the Meyer capacitances at 
threshold. Discontinuities in the derivatives of the Meyer capacitances occur at the onset of 
saturation as a result of additional approximations. Such discontinuities should be avoided 
in the device models since they give rise to increased simulation time and conversion 
problems in circuit simulators. These issues will be discussed further in Section 1.5. 

In the MOSFET VSM, the above-threshold capacitance expressions derived on the 
basis of the SCCM are still valid in the nonsaturated regime Vos < Vsat- The capacitance 
values at the saturation point are found by replacing V DS in (1.59) and (1.60) by Vsat 
from (1.55), yielding 



QjSs — 



1 - 



Vsat 

2VT 



C CDS = -C i 



Vsat \ 
1 - 1 - — 
2v l ; 



(1.66) 

(1-67) 



However, well into saturation, the intrinsic gate charge will change very little with 
increasing Vds- similar to what takes place in the case of saturation by pinch-off (see 
preceding text). Hence, the real capacitances have to approach the same limiting values 
in saturation as the Meyer capacitances, that is, Cqs/Q -> 2/3 and Cgd/Q — >• 0. In 
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fact, since the behavior of Cq s and Cqd in the VSM and in the SCCM coincide for 
Vds < V^sat and have the same asymptotic values in saturation, the Meyer capacitance 
model offers a reasonable approximation for the MOSFET capacitances also in short- 
channel devices. This suggests a separate “saturation” voltage for the capacitances close 
to the long-channel pinch-off voltage (~ Vgt), which is larger than Vsat associated with 
the onset of velocity saturation. 



1.4.5 Comparison of Basic MOSFET Models 

The I-V characteristics shown in Figure 1.18 were calculated using the three basic 
MOSFET models discussed above - the simple charge control model (SCCM), the Meyer 
I-V model (MM), and the velocity saturation model (VSM). The same set of MOSFET 
parameters were used in all cases. We note that all models coincide at small drain-source 
voltages. However, in saturation, SCCM always gives the highest current. This is a direct 
consequence of omitting velocity saturation and spatial variation in the depletion charge in 
SCCM, resulting in an overestimation of both carrier velocity and inversion charge. The 
characteristics for VSM and MM clearly demonstrate how inclusion of velocity saturation 
and distribution of depletion charge, respectively, affect the saturation current. 

The intrinsic capacitances according to Section 1.4.4 are shown in Figure 1.19. Meyer’s 
capacitance model can be used in conjunction with all the MOSFET models illustrated in 
Figure 1.18 (SCCM, MM and VSM). In the present device example, we note that velocity 
saturation and depletion charge may be quite important. Therefore, we emphasize that 
SCCM is usually applicable only for long-channel, low-doped devices, while MM applies 
to long-channel devices with an arbitrary doping level. VSM gives a reasonable description 
of short-channel devices, although important short-channel effects such as channel-length 
modulation and drain-induced barrier lowering (DIBL) are still unaccounted for in these 




Figure 1.18 Comparison of I-V characteristics obtained for a given set of MOSFET parameters 
using the three basic MOSFET models: simple charge control model (solid curves), Meyer's I-V 
model (dashed curves), and velocity saturation model (dotted curves). The MOSFET device parame- 
ters are L = 2p,m, W = 20p,m, d x = 300 A; /x„ = 0.06m 2 /Vs, v s = 10 5 m/s; (V a = 10 22 /m 3 , V T = 
0.43 V; Vp B = -0.75 V; e, = 3.45 x 10' n F/m; e s = 1.05 x 10- 10 F/m; m = 1.05 x 10 16 /m 3 . Re- 
produced from Fjeldly T. A., Ytterdal T.. and Shur M. (1998) Introduction to Device Modeling and 
Circuit Simulation, John Wiley & Sons, New York 
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Figure 1.19 Intrinsic MOSFET C-V characteristics for the same devices as in Figure 1.18, 
obtained from the Meyer capacitance model. The circles indicate the onset of saturation according 
to (1.66) and (1.67). Reproduced from Fjeldly T. A., Ytterdal T., and Shur M. (1998) Introduction 
to Device Modeling and Circuit Simulation , John Wiley & Sons, New York 

models. Likewise, we have ignored certain high-field effects (avalanche breakdown), and 
advanced MOSFET designs. Some of these issues will be discussed in Section 1.5 and 
in later chapters of this book. 



1.4.6 Basic Small-signal Model 

So far, we have considered large-signal MOSFET models, which are suitable for digital 
electronics and for determining the operating point in small-signal applications. The small- 
signal regime is, of course, a very important mode of operation of MOSFETs as well as 
for other active devices. Typically, the AC signal amplitudes are so small relative to the 
DC values of the operating point that a linear relationship can be assumed between an 
incoming signal and its response. Normally, if sufficiently accurate large-signal models 
are available, the AC designers will use such large-signal models also for small-signal 
applications, since this mode is readily available in circuit simulators such as SPICE. 
However, in cases when suitable large-signal models are unavailable or when simple 
hand calculations are needed, it is convenient to use a dedicated small-signal MOSFET 
model based on a linearized network. 

Figure 1.20 shows an intrinsic, common-source, small-signal model for MOSFETs. 
The model is generalized to include inputs at both the gate and the substrate terminal, 
and the response is observed at the drain (Fonstad 1994). The network elements are 
obtained as first derivatives of current- voltage and charge -voltage characteristics, result- 
ing in fixed small-signal conductances, transconductances, and capacitances for a given 
operating point. 

To build a more complete model, some of the extrinsic parasitics may be added, 
including the gate overlap capacitances and the source and drain junction capacitances, 
shown in Figure 1.15, and the source and drain series resistances. At very high frequencies, 
in the radio frequency (RF) range, the junction capacitances become very important since 
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Figure 1.20 Basic small-signal equivalent circuit of an intrinsic, common-source MOSFET. Re- 
produced from Fonstad C. G. (1994) Microelectronic Devices and Circuits, McGraw-Hill, New York 

they couple efficiently to the MOSFET substrate. Other important parasitics in this range 
are the gate resistance and the series inductances associated with the conducting paths. 
RF CMOS modeling will be discussed in more detail in Chapter 3 of this book. 



1.5 ADVANCED MOSFET MODELING 

The rapid evolution of semiconductor electronics technology is fueled by a never-ending 
demand for better performance, combined with a fierce global competition. For silicon 
CMOS technology, this evolution is often measured in generations of three years - the 
time it takes for manufactured memory capacity on a chip to be increased by a factor of 4 
and for logic circuit density to increase by a factor of between 2 and 3. Technologically, 
this long-term trend is made possible by a steady downscaling of CMOS feature size by 
about a factor of 2 per two generations. 

At present, CMOS in high volume manufacturing has progressed to the 130-nm tech- 
nology node. The technology node, used as a measure of the technology scaling, typically 
signifies the half-pitch size of the first-level interconnect in dynamic RAM (DRAM) tech- 
nology, while the smallest features, the MOSFET gate lengths, are presently at 65 nm. 
Following the evolutionary trend, the technology node is expected to decrease below 
lOOnm within a few years, as indicated in Figure 1.21. Simultaneously, the performance 
of CMOS ICs rises steeply, packing several 100 million transistors on a chip and operating 
with clock rates well into the gigahertz range. 

Very important issues in this development are the increasing levels of complexity of 
the fabrication process and the many subtle mechanisms that govern the properties of 
deep submicrometer FETs. These mechanisms, dictated by the device physics, have to be 
described and implemented into process modeling and circuit design tools, to empower 
the circuit designers with abilities to fully utilize the potential of existing and future 
technologies. 

The downscaling of FETs tends to augment important nonideal phenomena, most of 
which have to be incorporated into any viable device model for use in circuit simulation 
and device design. These include the so-called short-channel effects, which tend to weaken 
the gate control over the channel charge. Among the manifestations of short-channel phe- 
nomena are serious leakage currents associated with punch-through and threshold voltage 
shifts resulting from increasing influence of the source and drain contacts over the intrinsic 
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Figure 1.21 Projected CMOS scaling according to International Technology Roadmap for Semi- 
conductors. Reproduced from ITRS - International Technology Roadmap for Semiconductor, Semi- 
conductor Industry Assoc., Austin, TX (2001) 

channel and depletion charges. The drain-source bias induces an additional lowering of 
the injection barrier near the source, giving rise to further shifts in the threshold voltage. 
The latter also causes an increased output conductance in saturation. The loss of gate con- 
trol may be interpreted as resulting from an improper collective scaling of dimensions, 
doping levels, and voltages in the device, since an ideal scaling scheme is difficult to 
enforce in practice. 

Gate leakage is another deleterious effect that occurs in radically downscaled devices 
with gate oxide thicknesses of one to two nanometers. This leakage is the result of 
quantum-mechanical tunneling, an effect that actually poses a fundamental limitation for 
further MOSFET scaling within the next few decades. 

In addition to these “new” phenomena, well-known effects from earlier FET genera- 
tions become magnified at short gate lengths owing to enhanced electric fields associated 
with improper scaling of voltages. Examples are channel-length modulation (CLM), bias 
dependence of the field effect mobility, and phenomena related to hot electron -induced 
impact ionization near drain. 

The above mechanisms also have important consequences for the modeling of MOSFETs. 
All the presently accepted MOSFET models used by industry, including the latest BSIM 
models (Berkeley short-channel IGFET models), are, in effect, based on the GCA. As dis- 
cussed in Section 1.4, the GCA allows a separation of the model development into two 
coupled equations, one describing the local vertical field and charge distribution by means 
of a one-dimensional Poisson’s equation and another describing the lateral charge transport 
in the channel. In improperly scaled devices, this description becomes seriously flawed 
since the electrostatic problem of the gate region truly becomes a two-dimensional one, 
with lateral and vertical fields and field gradients of similar magnitudes. The consequence 
is that the GCA-based models have to be augmented by numerous empirical and semiem- 
pirical “fixes” to maintain the required accuracy. This has resulted in a plethora of device 
parameters, counting in the hundreds for the latest BSIM models. 
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1.5.1 Modeling Approach 

For any FET, the threshold gate voltage Vt is a key parameter. It separates the on- (above- 
threshold) and the off- (subthreshold) states of operation. As indicated in Figure 1.22, 
the average potential energy of the channel electrons in the p/f-state is high relative to 
those of the source, creating an effective barrier against electron transport from source to 
drain. In the on-state, this barrier is significantly lowered, promoting a high population 
of free electrons in the channel region. For long-channel devices, with gate lengths of 
several micrometers and with high power supply voltages, the behavior in the transition 
region near threshold is not important in digital applications. However, for MOSFETs 
with deep submicrometer feature size and reduced power supply voltages (such as in 
low-power operation), the transition region becomes increasingly important, and the dis- 
tinction between on- and off - states becomes blurred. Accordingly, a precise modeling 
of all regimes of device operation, including the near-threshold regime, is needed for 
short-channel devices, both for digital and high-frequency analog applications. 

In the basic MOSFET models considered in Section 1.4, the subthreshold regime is 
simply considered an p/f-state of the device, ideally blocking all drain current (although 
the SPICE implementations of some of these models include descriptions of this regime). 
In practice, however, there will always be some leakage current in the off - state owing to 
a finite amount of mobile charge in the channel and a finite rate of carrier injection from 
the source to the channel. 

This effect is enhanced in modern day downscaled MOSFETs owing to short-channel 
phenomena such as drain-induced barrier lowering. DIBL is a mechanism whereby the 
application of a drain-source bias causes a lowering of the source-channel junction barrier. 
In a long-channel device biased in the subthreshold regime, the applied drain-source 
voltage drop will be confined to the channel-drain depletion zone. The remaining part 
of the channel is essentially at a constant potential (flat energy bands), where diffusion 
is the primary mode of charge transport. However, in a short-channel device the effect 
of the applied drain-source voltage will be distributed over the length of the channel, 
giving rise to a shift of the conduction band edge near the source end of the channel, as 
illustrated in Figure 1.23. Such a shift represents an effective lowering of the injection 
barrier between the source and the channel. Since the dominant injection mechanism 
is thermionic emission, this barrier lowering translates into a significant increase of the 
injected current. This phenomenon can be described in terms of a shift in the threshold 
voltage (see, e.g., Fjeldly and Shur 1993). Well above threshold, the injection barrier is 
much reduced, and the DIBL effect eventually disappears. 



Off- state 




Figure 1.22 Schematic conduction band profile through the channel region of a short-channel 
MOSFET in the on-state and the p/f-state 
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Figure 1.23 Conduction band profile at the semiconductor-oxide interface of a short n -channel 
MOSFET with and without drain bias. The figure indicates the origin of DIBL. Reproduced from 
Fjeldly T. A., Ytterdal T., and Shur M. (1998) Introduction to Device Modeling and Circuit Simu- 
lation, John Wiley & Sons, New York 



The magnitude of the subthreshold current is obviously very important since it has 
consequences for the power supply voltages and the logic levels needed to achieve a 
satisfactory off - state in digital operations. Hence, it affects the power dissipation in logic 
circuits. Likewise, the holding time in dynamic memory circuits is affected by the level 
of subthreshold current. 

To correctly model the subthreshold operation of MOSFETs, we need a charge control 
model for this regime. Also, to avoid convergence problems when using the model in 
circuit simulators, it is preferable to use a UCCM that covers both the above- and below- 
threshold regimes with continuous expressions. One such model is a generalization of the 
UCCM that was introduced in Section 1.2.4 for the purpose of accurately describing the 
inversion charge density in MOS structures (Lee et al. 1993), 

Vgt - otV f(x) = p V* In +a[n s (x) - n 0 ]. (1.68) 



Here, Vf is the quasi-Fermi potential in the channel measured relative to the Fermi 
potential at the source and o' is a constant with a value close to unity called the bulk 
effect parameter. We note that in strong inversion, Vf (x ) can be replaced by the channel 
potential V (x) and the linear term in n s (x) will dominate on the right-hand side, signifying 
that charge transport in the channel will be drift current. Below threshold, the logarithmic 
term dominates on the right-hand side and the charge transport is primarily by diffusion. 

Although (1.68) does not have an analytical solution with respect to n s , we can use 
a generalized version of the approximate analytical expression introduced for the MOS 
capacitor in (1.38), 



n. = 2n„ In 



1 / Vgt — otV f 



<1.691 



This and related models have since been successfully applied to various FETs including 
MOSFETs, MESFETs, HFETs, poly-Si thin film transistors (TFTs), and a-Si TFTs (see 
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Fjeldly et al. 1998). The unified MOSFET model based on the UCCM expression in (1.68) 
is discussed in Chapter 8 (see also Shur et al. 1992). More elaborate MOSFET models 
such as the BSIM4 and EKV models are discussed in Chapters 6 and 7, respectively. 
They include a number of advanced features relating to small feature sizes and scaling of 
device dimensions. 

BSIM4 is presently the most advanced MOSFET model supplied with Berkeley SPICE, 
and has been adopted in most commercial simulators. Although the BSIM models are 
characterized by a large number of SPICE parameters (in the hundreds), they have gained 
a wide popularity for use in professional circuit simulation and design, and have been 
accepted as an industry standard in the United States. 

In Section 1.5.2, we consider more closely some of the advanced features included 
in modern MOSFET models, such as velocity saturation, gate bias -dependent mobility, 
impact ionization, drain and source series resistances (extrinsic modeling), channel-length 
modulation, and DIBL. In Section 1.5.3, we continue the discussion of the MOSFET 
capacitances from Section 1.4.4 and present a unified and charge-conserving description 
of the intrinsic capacitance-voltage characteristics. 



1.5.2 Nonideal Effects 



1.5. 2.1 High-field effects 

Channel -length modulation When the drain-source bias of a FET approaches the drain 
saturation voltage, a region of high electric field forms near the drain and the electron 
velocity in this region saturates (in long devices, we instead have pinch-off where n s 
becomes very small near drain). In saturation, the length AL of the high-field region 
expands in the direction of the source with increasing V DS , and the MOSFET behaves 
as if the effective channel length has been reduced by AL. This phenomenon is called 
channel-length modulation (CLM). The following simplified expression links V DS to the 
length of the saturated region (see Lee et al. 1993): 



^DS — Vp + V a 




- 1 



(1.70) 



where V p , V a , and / are parameters related to the electron saturation velocity, the field 
effect mobility, and the drain conductance in the saturation regime. In fact, V p is the 
potential at the point of saturation in the channel, which is usually approximated by the 
saturation voltage Vs at- Good agreement has been obtained between the potential profile 
described by (1.70) and that obtained from a two-dimensional simulation for the saturated 
region of an n -channel MOSFET. 

The CLM effect manifests itself as a finite output conductance in saturation, which 
tends to remain constant over a wide range of drain biases. The output conductance 
also increases steadily with increasing gate bias. This observation suggests an even sim- 
pler model than that in (1.70) for describing CLM, where the basic expression for the 
drain current is simply multiplied by the first-order term (1 +A.V DS ). In this case, the 
CLM parameter k can easily be extracted from the output conductance in the saturation 
regime, well above threshold. This first-order approximation is implemented in several 
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FET models used in circuit simulators, while expressions similar to (1.70) are used in the 
BSIM models. 

Hot-carrier effects Hot-carrier effects are among the main concerns when shrinking FET 
dimensions into the deep submicrometer regime. Reducing the channel length while retain- 
ing high power supply levels, known as constant voltage scaling, results in increased 
electric field strengths in the channel, causing acceleration and heating of the charge 
carriers. 

Some of the manifestations of hot electrons on device operation are breakdown and 
substrate current caused by impact ionization, creation of interface states, gate current 
resulting from hot-electron emission across the interface barrier, oxide charges owing to 
tunneling of charge carriers into oxide states, and photocurrents caused by electron-hole 
recombination with emission of photons (see following text). 

The substrate current resulting from electron-hole pair generation may overload sub- 
strate-bias generators, introduce snapback breakdown, cause CMOS latch-up, and generate 
a significant increase in the subthreshold drain current. A complete model for the sub- 
strate current is too complex for use in circuit level simulation. Instead, the following, 
approximate, analytical expression is widely used: 

At ( l d Bj \ 

Cubstr - /ds-r^Vcs - Vsat) exp - d , (1.71) 

t>i \ Eds — 'sat / 

where I ds is the channel current, A, and B, are the ionization constants, Vsat is the satu- 
ration voltage, and l d is the effective ionization length. This expression is also applicable 
in the subthreshold regime by using Vsat = 0 (Iniguez and Fjeldly 1997). 

In FETs fabricated on an insulating layer, such as silicon-on-insulator (SOI) MOSFETs, 
impact ionization may give rise to a charging of the transistor body, causing a shift in the 
threshold voltage. This effect results in an increased drain current in saturation (floating 
body effect). Related mechanisms are also observed in amoiphous TFTs (Wang et al. 
2000) and polysilicon TFTs (Iniguez et al. 1999). 

At sufficiently high drain bias, we have impact ionization and avalanche breakdown in 
all types of FETs. In MOSFETs, a substantial amount of the majority carriers created by 
impact ionization near drain will flow toward source and forward-bias the source-substrate 
junction, causing injection of minority carriers into the substrate. This effect can be 
modeled in terms of conduction in a parasitic bipolar transistor, as described by Sze (1981). 
In MESFETs, the breakdown usually takes place in the high-field depletion extension 
toward the drain. 

Electron trapping in the oxide and generation of interface traps caused by hot-electron 
emission induce degradation of the MOSFET channel near drain in conventional MOSFETs 
or cause changes in the parasitic drain resistances in low-doped drain (LDD) MOSFET 
(Ytterdal et al. 1995). Reduced current drive capability and transconductance degrada- 
tion are manifestations of interface traps in n -MOSFET characteristics. Reduced current 
also leads to circuit speed degradation, such that the circuits may fail to meet speed 
specifications after aging. 

Photon emission and subsequent absorption in a different location of the device may 
cause unwanted photocurrent, which, for example, may degrade the performance of mem- 
ory circuits. 
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Temperature dependence and self-heating Since electronic devices and circuits have to 
operate in different environments, including a wide range of temperatures, it is imperative 
to establish reliable models for such eventualities. Heat generated from power dissipation 
in an integrated circuit chip can be considerable, and the associated temperature rise must 
be accounted for both in device and circuit design. In conventional silicon substrates, the 
thermal conductivity is relatively high such that a well-designed chip placed on a good 
heat sink may achieve a reasonably uniform and tolerable operating temperature. However, 
such design becomes increasingly difficult as the device dimensions are scaled down and 
power dissipation increases. The thermal behavior of MOSFETs has been extensively 
studied in the past, and the temperature dependencies of major model parameters have 
been incorporated in SPICE models. 

Circuits fabricated on substrates that are poor heat conductors, such as GaAs and silicon 
dioxide, are more susceptible to a significant self-heating effect (SHE). In thin film SOI 
CMOS, the buried Si02 layer inhibits an effective heat dissipation, and the self-heating 
manifests itself as a reduced drain current and even as a negative differential conductance 
at high power inputs. Hence, for a reliable design of SOI circuits, accurate and self- 
consistent device models that account for SHE are needed for use in circuit simulation. 

The influence of SHE on the electrical characteristics of SOI MOSFETs can be eval- 
uated using a two-dimensional device simulator incorporating heat flow or by combining 
a temperature rise model with an I—V expression through an iteration procedure. But the 
effect can also be described in terms of a temperature-dependent model for the device’s 
I-V characteristics combined with the following simplified relationship between temper- 
ature rise and power dissipation: 



T — T 0 — /?(h/dVd s . 



(1.72) 



Here T is the actual temperature, T 0 is the ambient (substrate) temperature, and R th is a 
thermal resistance that contains information on thermal conductivity and geometry. The 
equations can be solved self-consistently, either numerically or analytically (see Cheng and 
Fjeldly 1996). Once the temperature dependence of the device parameters are established, 
the same procedure can also be used for describing self-heating in other types of devices, 
such as amorphous TFTs (Wang el al. 2000), GaAs MESFETs and HFETs. 



Gate bias-dependent mobility In submicron MOSFETs, scaling dictates that the gate 
dielectric must be made very thin. In a sub-0. 1-p.m device, the gate dielectric may be 
as thin as a few nanometers. With a gate-source voltage of 1 V, this corresponds to a 
transverse electric field of nearly 500kV/cm. In this case, electrons are confined to a very 
narrow region at the silicon-silicon dioxide interface, and their motion in the direction 
perpendicular to the gate oxide is quantized. This close proximity of the carriers to the 
interface enhances the scattering rate by surface nonuniformities, drastically reducing the 
field effect mobility in comparison to that of bulk silicon. 

To a first-order approximation, the following simple expression accurately describes 
the dependence of the field effect mobility on the gate bias (Park et al. 1991) 



dn — don — Kn(VoS + Vt)- 



(1.73) 
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Figure 1.24 Electron mobility versus V G s + Vt for a long-channel (L = 20 |xm) and a short- 
channel (L = 1 p,m) NMOS for different values of substrate bias. The solid line corresponds to the 
linear approximation used in (1.73). Reproduced from Park C. K. et al. (1991) A unified charge 
control model for long channel n-MOSFETs, IEEE Trans. Electron Devices , ED-38, 399-406 

The experimental MOSFET mobility data in Figure 1.24 shows that this expression 
can be applied with the same set of parameters for different values of substrate bias. The 
parameter values are fairly close even for devices with quite different gate lengths. All in 
all, this leads to a reduction in the number of parameters needed for accurate modeling 
of the MOSFET characteristics. 

More complete expression for the MOSFET field effect mobility, which takes both tem- 
perature variations and scaling into account, are used in the BSIM models (see Chapter 6). 



1.5. 2. 2 Short-channel effects 

Aspect ratio To first order, FET dimensions are scaled by preserving the device aspect 
ratio, that is, the ratio between the gate length and the active vertical dimension of the 
device. In MOSFETs, the vertical dimension accounts for the oxide thickness d,. the 
source and drain junction depths r p and the source and drain junction depletion depths 
IK, and W d . A low aspect ratio is synonymous with short-channel behavior. The following 
empirical relationship indicates the transition from long-channel to short-channel behavior 
(Brews et al. 1980): 

L < L min (p,m) = 0.4[rj(p,m)d/(A)(Wd + W s ) 2 (p,m 2 )] 1/3 (1.74) 

When L < L m the MOSFET threshold voltage Vt will be affected in several ways 
as a result of reduced gate control. First, the depletion charges near source and drain 
are under the shared control of these contacts and the gate. In a short-channel device, 
the shared charge will constitute a relatively large fraction of the total gate depletion 
charge, giving rise to an increasingly large shift in Vt with decreasing L. Also, the shared 
depletion charge near drain expands with increasing drain-source bias, resulting in an 
additional V DS -dependent shift in Vt (DIBL effect, see following text). 
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Drain-induced barrier lowering The threshold voltage is a measure of the strength of 
the barrier against carrier injection from source to channel. In the short-channel regime 
(L < L m i n ), this barrier may be significantly modified by the application of a drain bias, as 
was schematically depicted in Figure 1.23. In n -channel FETs, this drain-induced barrier 
lowering (DIBL) translates into a lowering of the threshold voltage (n -channel MOSFET) 
and a concomitant rise in the subthreshold current with increasing F DS . The combined 
scaling and DIBL effect on the threshold voltage may be expressed as follows: 



V T (L) = V To (L) - cr(L)Vvs 



(1.75) 



where Vj 0 (L) describes the scaling of Vj at zero drain bias resulting from charge sharing 
and cr(L ) is the channel-length-dependent DIBL parameter. In the long-channel case, 
where L > L min , V T should become independent of L and V DS . This behavior can be 
modeled by letting both Vj t JL) and a(L) scale approximately as exp( — L/Z. mm ). In 
BSIM, somewhat more detailed scaling functions and also a dependence on substrate bias 
are used (see Chapter 6). 

In Figure 1.25(a), we show experimental data of Vt versus Vo for two n -channel 
MOSFETs with short gate lengths. A good agreement with the linear relationship of (1.75) 
is indicated. Also, the exponential scaling for V T versus L is confirmed by experiments, 
except for a deviation at the shortest gate lengths, as shown in Figure 1.25(b) (Fjeldly 
and Shur 1993). 

As stated above, DIBL vanishes well above threshold. Lor modeling purposes, we 
therefore adopt the following empirical expression for o (Lee et al. 1993): 



1 + exp 




(1.76) 



where Vg t0 is the gate voltage overdrive at zero drain-source bias and the parameters V at 
and V a determine the voltage and the width of the DIBL fade-out, respectively. We note 
that a -> a 0 for Vg t0 < V at and a -> 0 for Vg to > V at . 

The DIBL effect can be accounted for in our I-V models by adjusting the threshold 
voltage according to (1.75) in the expressions for the saturation current and the linear 
channel conductance. Likewise, the UCCM expression in (1.68) is modified as follows: 



Vgto + er Vos ~ aV F « pV th In j + a(n s - n 0 ) (1-77) 

where V G to is the intrinsic threshold voltage overdrive at zero drain-source bias. 

A related effect of device miniaturization is observed in narrow-channel LETs, where 
charges associated with the extension of the gate depletion regions beyond the nominal 
width of the gate may become a significant fraction of the total gate depletion charge. 
In this case, a one-dimensional analysis will underestimate the total depletion charge 
and give a wrong prediction of the threshold voltage. In practice, the threshold voltage 
increases (« -MOSFET) as the channel width is reduced. A common method of modeling 
this effect is to add an additional term in the threshold voltage expression containing a 
1 IW term, where W is the effective width of the gate. 
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(b) 



Figure 1.25 DIBL effect: (a) experimental threshold voltage shift versus drain-source voltage for 
two « -MOSFETs with different gate lengths and (b) experimental threshold voltage shifts ver- 
sus gate length compared with exponential scaling. Reproduced from Fjeldly T. A. and Shur M. 
(1993) Threshold voltage modeling and the subthreshold regime of operation of short- 
channel MOSFETs, IEEE Trans. Electron Devices, TED-40, 137-145 



1.5. 2. 3 Gate leakage and effective oxide thickness 

The basic properties and the integrity of the silicon dioxide gate dielectric has been 
essential for the success of the silicon MOSFETs. However, as the CMOS technology 
node (half-pitch size in DRAMs) of MOSFETs in large-scale integration moves into the 
sub-100-nm range, this very success factor harbors one of the most difficult issues facing 
a continued evolution along the trend described by Moore’s law. The reason for this 
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lies in the nonyielding rales of device scaling combined with the well-known quantum- 
mechanical phenomenon of tunneling. In fact, to derive sufficient advantage of sub-100 nm 
technology, the gate oxide thickness has to be scaled down to just a couple of nanometers 
or less, corresponding to only a handful of atomic layers (see ITRS 2001). At such 
small dimensions, the tunneling leakage current through the oxide from the gate to the 
semiconductor becomes significant enough to add noticeably to the power consumption 
and to interfere with the device operation. 

An additional problem arises from the long-term reliability of such ultrathin dielectric 
films (Stathis 2002). These problems grow rapidly with further scaling, ultimately with 
completely debilitating consequences. The limits for viable scaling have recently been 
predicted to be at a technology node of 50 nm (gate length of 25 nm) and a silicon 
dioxide thickness of 1 nm (Wu et al. 2002). 

A temporary solution to this impasse is to replace the silicon dioxide with materials 
that have much larger dielectric constants, so-called high-k insulators. This way, the same 
scaling advantage can be derived using a correspondingly thicker insulator with reduced 
tunneling current. Many such materials are presently being investigated, but it is hard to 
find candidates that can match the chemical and electrical properties of silicon dioxide 
and its excellent interface with silicon. If this development meets with success, the end of 
the present evolutionary trend in MOSFET/CMOS technology may be extended for yet 
another decade, bringing the technology node to about 20 nm (see ITRS 2001). Within 
this time frame, alternative MOSFET architectures with improved short-channel properties 
will also have to be developed for large-scale integration, including Vertical, FinFET, and 
planar double gate structures. 

Another problem related to the thin dielectrics in MOSFETs is the relative importance 
of the inversion layer thickness in the semiconductor and the depletion layer thickness 
when using polysilicon gate electrodes. The former is a result of the lack of vertical con- 
finement of the carriers, especially electrons, owing to the quantum-mechanical uncertainty 
and exclusion principles in combination with the finite steepness of the semiconductor 
band bending at the interface. In terms of device performance, these layers add to the 
effective oxide thickness <7 e ff, thereby reducing the gate’s field effect coupling to the chan- 
nel. The two layers may contribute a few tenths of a nanometer each to <f e ff, which is quite 
significant for radically scaled MOSFETs. Development of suitable metal gate electrodes 
will alleviate some of the problem. 



1.5.3 Unified MOSFET C-V Model 



1.5.3. 1 Unified Meyer C-V model 

In order to develop unified expressions for the intrinsic MOSFET capacitances, we return 
to the Meyer capacitances discussed in Section 1 .4. The Meyer large-signal equivalent is 
shown in Figure 1.16 and the gate-source and gate-drain capacitances are given by (1.59) 
and (1.60), respectively. The only device-specific part of these equations is the gate- 
channel capacitance C c h at zero drain bias (Vp = 0), which can be derived from the 
UCCM expression of (1.69), that is, 



C ch - WLq 



d« s 

dVoT 



Ci 



Vqt \ 

nv*). 



1+2 exp 



(1.78) 
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Well above or below threshold, this expression has the familiar asymptotic forms 



C, 



ch 



c 



^ ^ Ci / V GT 

c ch ~ y exp 



(1.79) 

(1.80) 



respectively. Figure 1.26 shows C c h versus Vgt in a linear and a semilogarithmic plot. 
From UCCM and from (1.78), we find that C c h = C;/3 at threshold, which may serve as 
a convenient and straightforward way of determining the threshold voltage from experi- 
mental C c h versus Vqs curves. 

In the subthreshold regime, the gate-substrate capacitance Cqb of (1.65) is the dominant 
Meyer capacitance in MOSFETs. Above threshold, Cgb vanishes in the ideal long-channel 
case. A unified version of Cgb that includes a gradual phase-out above threshold can be 
modeled as follows: 



Cgb 



Cj/(1 + njn q) 

\/l +4(Vgs — ^bs — V m )/ Y 2 



(1.81) 



Here n s is the unified electron density given by UCCM in (1.68) or its approximate 
solution (1.69). Equation (1.81) utilizes the fact that the increasing density of inversion 
charge above threshold gradually shields the substrate from the influence of the gate 
electrode. A typical plot of Cqb versus Vqt is shown in Figure 1.27. 

Using this unified gate-channel capacitance in conjunction with Meyer’s capacitance 
model, we obtain the following continuous expressions for the intrinsic gate-source capac- 
itance Cqs and the gate-drain capacitance Cgd, valid for all regimes of operation: 



CgS = ~ C c h 



/ VGTe — VnSe \ 
\2VGTe — UxSc / 



(1.82) 




Figure 1.26 Normalized channel capacitance versus V’G'r/'/Vth according to (1.78) in a linear plot 
(right) and a semilog plot (left). The condition C c h/Ci = 1/3 at threshold is indicated. Reproduced 
from Fjeldly T. A., Ytterdal T., and Shur M. (1998) Introduction to Device Modeling and Circuit 
Simulation, John Wiley & Sons, New York 
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Figure 1.27 Normalized and unified Meyer-type gate-substrate capacitance versus VoT/'jVth 
according to (1.81) for V B s = 0. Typical values for an H-channel MOSFET with a polysilicon gate 
were used: V T = 0.7 V, V FB = — 1 V, y = 1 V 1 / 2 , and j; = 1.33. Reproduced from Fjeldly T. A., 
Ytterdal T., and Shur M. (1998) Introduction to Device Modeling and Circuit Simulation, John 
Wiley & Sons, New York 



Cgd — ^C c h 

Here, V DSe is an effective intrinsic drain-source voltage that is equal to Vds for V DS < V G Te 
and is equal to V G Te for Vds > V GTe ■ VoTe is the effective gate voltage overdrive, which 
equals Vgt above threshold and is of the order of the thermal voltage in the subthreshold 
regime. The following expression is used to model this behavior: 



V G Te 



214 



GTe 



VoSe 



(1.83) 



VGTe — Vth 



1 + ^1 + 

2 V* 




(1.84) 



where 8 determines the width of the transition region at threshold (Vq T = 0). Typically, 
8 — 3 is a good choice. 

A smooth transition between the nonsaturated and the saturated regimes is assured 
by using the following type of interpolation expression for effective intrinsic drain- 
source voltage: 




Vds + VoTe — 



y vj + (Vds — V G Te) 2 



(1.85) 



where V 5 is a constant voltage that determines the width of the transition region. This 
parameter may be treated as an adjustable parameter to be extracted from experiments. 
VoTe is needed to assure a smooth transition between the correct limiting I-V and C-V 
expressions above and below threshold. 

A comparison of the normalized dependencies of Cqs and Cgd on Vds is shown in 
Figure 1 .28 for Vs / VoTe = 0, corresponding to the nonunified Meyer capacitances, and for 
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Figure 1.28 Normalized and nonunified Meyer capacitances according to (1.59) and (1.60) 
(dashed lines) and unified Meyer capacitances according to (1.82) and (1.83) (solid lines), using a 
transition width parameter Vj = 0.2 V GTe . Reproduced from Fjeldly T. A., Ytterdal T., and Shur M. 
(1998) Introduction to Device Modeling and Circuit Simulation, John Wiley & Sons, New York 

a more realistic value of V$/Vq x e = 0.2. On the basis of the discussion in Section 1.4.4, 
we can conclude that the present unified version of the Meyer capacitances is applicable 
also for short-channel devices. Still more flexible expressions for the capacitances are 
obtained by substituting Vgt by /Vgt in (1.82) and (1.83), where y is an adjustable 
parameter close to unity. 



1.5. 3. 2 Ward-Dutton model 



As we discussed in Section 1.4, an accurate modeling of the intrinsic capacitances asso- 
ciated with the gate region of FETs requires an analysis of the charge distribution in 
the channel versus the terminal bias voltages. Normally, the problem is simplified by 
assigning the distributed charges to the various “intrinsic” terminals. Hence, the mobile 
charge Qi of a MOSFET is divided into a source charge Qs — FpQi and a drain charge 
Q n — (1 — F p )Qj, where F p is a partitioning factor. The depletion charge Qn under 
the gate is assigned to the MOSFET substrate terminal. The total gate charge Qc, is the 
negative sum of these charges, that is, Q a — — Qj — Qb — —Qs~ Q d — Qb- Note that 
by assigning the charges this way, charge conservation is always assured. 

The net current flowing into terminal X can now be written as 



d Qx _ 3 Qx dVy 

~d T ~ ^ dVy ~dT 



y, XxyCxy 

Y 



dVy 

~dT’ 



(1.86) 



where the indices X and Y run over the terminals G, S, D , and B. In this expression, 
we have introduced a set of intrinsic capacitance elements Cxy, the so-called transcapac- 
itances, defined by 



Cxy = Xxy 



3 Qx 
3 Vy 



where xxy 



|-1 for X^Y 
1 1 for X — Y 



(1.87) 
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These are equivalent to the charge-based nonreciprocal capacitances introduced by Ward 
and Dutton (1978) and by Ward (1981). The term nonreciprocal means that we have 
C X y Cyx when X ^ Y. The elements Cxx are called self-capacitances. Cxy contain 
information on how much the charge Q x assigned to terminal X changes by a small 
variation in the voltage Vy at terminal Y . To illustrate why Cxy may be different from 
Cyx, assume a MOSFET in saturation. Then the gate charge changes very little when 
the drain voltage is perturbed since the inversion charge is very little affected, making 
Cgd small. However, if Vq is perturbed, the inversion charge changes significantly and 
so does Q n , making Cdg large. 

For the four-terminal MOSFET, the Ward-Dutton description leads to a total of 16 
transcapacitances. This set of 16 elements can be organized as follows in a 4 x 4 matrix, 
a so-called indefinite admittance matrix: 

Cgg QjS Cqd Cqb 
£ _ Csg Css Csd Csb 
Cdg Cds Cdd Cdb 
Cbg Cbs Cbd Cbb 

Here, the elements in each column and each row must sum to zero owing to the constraints 
imposed by charge conservation (which is equivalent to obeying Kirchhoff’ s current law) 
and for the matrix to be reference independent, respectively (see Arora 1993). This means 
that some of the transconductances will be negative, and of the 16 MOSFET elements, 
only 9 are independent. The complete MOSFET large-signal equivalent circuit, including 
the 16 transcapacitances, is shown in Figure 1.29. This compares with the simple Meyer 
model in Figure 1.16, which comprises 3 capacitances. 



( 1 . 88 ) 




Figure 1.29 Intrinsic large-signal MOSFET equivalent circuit including a complete set of nonre- 
ciprocal and charge-conserving transcapacitances. The transcapacitances Cxy are defined in the text. 
Reproduced from Fjeldly T. A., Ytterdal T., and Shur M. (1998) Introduction to Device Modeling 
and Circuit Simulation , John Wiley & Sons, New York 









42 MOSFET DEVICE PHYSICS AND OPERATION 



We note that in three-terminal FETs, such as HFETs, MESFETs, and TFTs, we have a 
total of 9 transcapacitances, of which 4 are independent (Nawaz and Fjeldly 1997). The 
equivalent circuit for this case is obtained from Figure 1.29 by removing the substrate 
terminal B and all elements connected to it. Also, the 4 x 4 matrix in (1.88) reduces to 
a 3 x 3 matrix. 

As explained in Section 1.4.4, the simplified C-V model by Meyer is obtained by 
taking derivatives of the total gate charge with respect to the various terminal voltages. 
The Meyer capacitances can be viewed as a subset of the Ward-Dutton capacitances. 
Although charge conservation is not assured in the Meyer model, the resulting errors in 
circuit simulations are usually small, but can in some cases lead to serious errors. The 
unified transcapacitances needed for the complete Ward-Dutton model can be obtained 
along the same lines as described for Cos and Cos ■ The accuracy of the model depends on 
the quality of the charge and current models used and on the partitioning of the inversion 
charge between the source and the drain terminal. 



1.5. 3. 3 Non-quasi-static modeling 

For very high-frequency operation of the MOSFET, comparable to the inverse carrier 
transport time of the channel (non-quasi-static (NQS) regime), we have to consider the 
temporal relaxation of the inversion and depletion charges. Most of the MOSFET models 
used in SPICE are based on the quasi-static assumption (QSA), in which an instantaneous 
charging of the inversion layer is assumed. Hence, circuit simulations will fail to accurately 
predict the performance of high-speed circuits. 

The channel of a MOSFET is analogous to a bias-dependent distributed RC network as 
indicated schematically in Figure 1.30. In QSA, the distributed gate-channel capacitance 
is instead lumped into discrete capacitances between the gate and source and drain nodes, 
ignoring the finite charging time arising from the RC product associated with the channel 
resistance and the gate-channel capacitance. 

The inclusion of the so-called Elmore equivalent circuit shown in Figure 1.31 can be 
viewed as a first step toward an NQS model. Using this equivalent circuit, the channel 
charge buildup is modeled with reasonable accuracy because the lowest frequency pole 
of the original RC network is retained. The Elmore resistance /rfi morc is calculated from 
the channel resistance in strong inversion as 



R 



Elmore 



L 



2 

eff 



^Meff £2ch 



(1.89) 



Gate 




Substrate 



Figure 1.30 Equivalent RC network representing the MOSFET channel 
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Gate 




Figure 1.31 Elmore non-quasi-static equivalent circuit 



where e is the Elmore constant with a theoretical value close to 5 and Q c h is the total 
charge in the channel. This formulation is only valid above threshold where the drift 
current dominates. 



To obtain a unified expression, including the subthreshold diffusion current, a relaxation 
time-based approach is adapted. The overall relaxation time for channel charging and 
discharging can be written as a combination of the contributions due to drift and diffusion 



as follows: 

1 _ i 1 

r Idrift Cliff 



(1.90) 



where 



r drift 



?(£eff/4) 

Tuff — j „ • 

MefmB T 



(1.91) 

(1.92) 



On the basis of this relaxation time concept, the NQS effect can be implemented in 
the SPICE MOSFET model using the subcircuit shown in Figure 1.32. The variable Qdef 
is an additional node created to keep track of the amount of deficit or surplus channel 
charge needed to achieve equilibrium. Q ( i e) will decay exponentially into the channel with 
a bias-dependent NQS relaxation time r, and the terminal currents can be written as 





Figure 1.32 Non-quasi-static subcircuit implementation in MOSFET SPICE models. The RC time 
constant r is determined by the resistance and capacitance values chosen 
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Q def 
T 



(1.94) 



Here X d = 1 — F p and X s = F p , where F p is the charge partitioning factor introduced in 
Section 1.5. 3. 2. 
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2 

MOSFET Fabrication 



2.1 INTRODUCTION 



Semiconductor devices have long been used in electronics since the late nineteenth cen- 
tury. The galena crystal detector, invented in 1907, was widely used to build crystal radio 
sets. However, the idea of placing multiple electronic devices on the same substrate mate- 
rial came only after the late 1950s. In 1959, the first integrated circuit (IC) was constructed, 
which started a new era of modern semiconductor manufacturing. In less than 50 years, the 
IC technology, represented primarily by the complementary-metal-oxide-semiconductor 
(CMOS) process, has gone through the periods from producing very simple chips contain- 
ing a few bipolar or MOS components to fabricating ultra-large-scale-integrated (ULSI) 
CMOS circuits with very high device densities from millions of transistors a chip for 
some circuits such as microprocessors to more than several billions of transistors a chip 
for some circuits such as memories. As predicted by Moore’s law created in the early 
1970s, the number of transistors per chip for a microprocessor has continued to dou- 
ble approximately every 18 to 24 months. Taking Intel’s processors as an example, the 
number of transistors on a chip has increased more than 3200 times, from 2300 on the 
4004 microprocessor in 1971 to 7.5 million on the Pentium II processor in 1996, and to 
55 million on the Pentium 4 processor in 2001. At the same time, the minimum dimension 
of the transistors has reduced from about 20 |xm in 1960 to 0.35 |xm in 1996, and more 
rapidly recently, to 0.13 |xm in 2001, resulting in an amazing improvement in both speed 
and cost of the circuits. 

The development of IC technology was driven mainly by the digital circuit (micropro- 
cessor and memory) market. Recently, however, CMOS technology has been extensively 
used in the analog circuit design because of the low cost of fabrication and compatibility 
of integrating both analog and digital circuits on the same chip, which improves the over- 
all performance and reliability and may also reduce the cost of packaging. It has been 
the dominant technology to fabricate digital ICs and will be the mainstream technology 
for analog and mixed-signal applications. Currently, circuit designers are even exploring 
emerging pure CMOS approaches - integrating digital blocks, analog and radio-frequency 
(RF) circuits on a single chip to implement the so-called mixed-signal (MS) or system- 
on-chip (SOC) solutions. 
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In today’s IC industry, much of the design efforts, including layout generators, device 
models, and technology files, have been automated in the design tools provided by either 
foundries or design automation vendors. However, a basic understanding of semiconductor 
devices and fabrication processes is essential to optimize the circuits, especially analog/RF 
circuits. This chapter provides a brief overview of the CMOS process. We first discuss the 
major process steps in CMOS fabrication. Then we will go though a typical digital process 
flow to understand the MOSFET structures and the concepts of MOSFET fabrication. 
Finally, additional fabrication steps for other components, mainly passive devices, in an 
analog/RF process will be discussed. 



2.2 TYPICAL PLANAR DIGITAL CMOS 
PROCESS FLOW 

The polysilicon gate CMOS process has been widely used for IC fabrication. A MOSFET 
process flow in a baseline polysilicon gate CMOS fabrication is described in the flowchart 
in Figure 2.1. The mask operations are illustrated in the figure. 

Starting material 

In this book, we only discuss CMOS technology that is fabricated from silicon, a very 
common and widely distributed element on earth. The mineral quartz consists entirely 
of silicon dioxide, also called silica. Ordinary sand is composed mainly of tiny grains 
of quartz and is therefore also mostly silica. Despite the abundance of its compounds, 
elemental silicon does not exist naturally. The element can be artificially produced by 
heating silica and carbon in an electric furnace. The carbon unites with the oxygen con- 
tained in the silica, leaving molten silicon. As it is solidified, it will be in a polycrystalline 
structure, that is, there is no regular crystal structure throughout the block of the material 
but simply small areas of crystals at different orientations to neighboring crystal areas. 
Impurities and disordering of the metallurgical-grade polysilicon make it unsuitable for 
semiconductor manufacture as a substrate material. The polysilicon can be refined in a 
purification process to produce an extremely pure semiconductor grade material. Once the 
material has been purified, it can be further processed into single crystal bars by using the 
so-called Czochralaki method, in which the purified material is completely molten and the 
seed crystal is dipped into the surface of the melt and slowly withdrawn and rotated. The 
speed of pull and the rate of cooling will determine the diameter of the final rod of the 
material. Dopant material can be introduced into the melt in the required ratio. Since ICs 
are formed upon the surface of a silicon crystal within a limited depth (< 10 |un ), the crys- 
tal bar is customarily sliced into numerous thin circular pieces called wafers. The larger the 
wafer, the more ICs it can have, and so the lower the fabrication cost. Most modern pro- 
cesses currently employ 200-mm (8") wafers. However, process lines for 300-mm (12") 
wafers have been announced to operate in 2002 by the semiconductor foundry industry. 

Although a certain amount of dopant material can be introduced into the material during 
the crystallization process, there is a limit on the doping level that can be introduced if 
a consistent dopant concentration is to be maintained throughout the material. To obtain 
the highly doped regions required by some of the active devices, a further crystal growth 
process called epitaxy is performed to provide a thin epitaxial region on the top of the 
“native” wafer. A continuous crystal structure has to be maintained, so the resulting 
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wafer is still a single crystal throughout. Epitaxy allows the formation of buried layers. 
The formation of an n + buried layer is one of the key steps in most bipolar and BICMOS 
processes. CMOS ICs are normally fabricated on a p-typc (100) substrate doped with 
boron. To provide a better immunity against CMOS latch-up, the substrate is usually 
doped as high as possible, limited by solid solubility, to minimize the substrate resistivity. 
In principle, this kind of p-type wafer can be used directly for fabrication. However, a 
lightly doped p-typc epitaxial layer is usually formed to maintain the latch-up immunity 
but have more precise control of the electrical properties of the substrate material and 
hence control of the MOSFET electrical characteristics. 




Nitride 
Pad oxide 

Silicon 



(1) Substrate wafer after the pad oxide growth and nitride deposition 



Nitride 
Pad oxide 

Silicon 



(2) Pad oxide and nitride removing for STI etch 





Nitride 
Pad oxide 

Silicon 



Figure 2.1 Illustration of the process flow for MOSFETs in a baseline CMOS technology 





TYPICAL PLANAR DIGITAL CMOS PROCESS FLOW 



51 




(9) p-channel V t adjusting and punch-through implantation 




(10) Gate oxide growth and poly deposition 




(1 1) Poly-gate formation for n/pFET and LDD implantation for 
both uFET and pFET 



Figure 2.1 ( continued ) 
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(17) Metal 1 layer deposition 




(18) Metal 1 etch 




(19) Interlayer isolation dielectric deposition 




(20) Via etch 



Figure 2.1 ( continued ) 
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72 -/ p-well formation 

The production of silicon wafers is only the first step in the fabrication of ICs. The con- 
struction of devices and circuits on the surface of the wafers in the planar process relies 
on processing to selectively deposit or remove the materials in some regions. Layout pat- 
terns, describing where areas of further doping, insulating layers, metal interconnections, 
and so on are located, need to be transferred onto the wafer. The technique of transferring 
the layer patterns onto the wafer is based on having layers of resistive material covering 
the surface, areas that can be selectively removed to expose the circuit below to doping, 
etching, or deposition of further layers. The process has many similarities to photographic 
techniques and is termed photolithography. 

After the wafer has been thermally oxidized, a layer of photoresist that has been spun 
on to it is patterned using the /?-well mask. Oxide-etch opens windows through which 
ion implantation deposits a controlled dose of phosphorus. A prolonged high-temperature 
drive creates a deep lightly doped 72 -type region called an n -well. The 77 -well for a typical 
20-V CMOS process has a junction depth of about 5 p,m. Thermal oxidation during the 
well drive covers the exposed silicon with a thin layer of pad oxide. 

In an 77-well CMOS process, 77MOS transistors are formed in the epitaxial layer, and 
pMOS transistors reside in the well. The increased total dopant concentration caused by 
counterdoping the well slightly degrades the mobility of majority carriers within it. The 
77 -well process therefore optimizes the performance of the 77MOS transistor at the expense 
of the pMOS transistor. As a side effect, the 77 -well process also produces the grounded 
substrate favored by circuit designers. 

A p-well CMOS process also exists. It uses an n + substrate, an 77-epitaxial layer, and 
a p-well. 77MOS transistors are formed in the p-well, and pMOS transistors are formed 
in the epitaxial layer/substrate. The p-well process optimizes the pMOS transistor at the 
expense of the 72MOS transistors. Since the 77-well process provides better 77MOS tran- 
sistors, which have better performance than pMOS transistors due to the higher carrier 
mobility, and the p-well process requires that the substrate is connected to the highest 
voltage supply instead of ground, which increases the design complexity in biasing the 
circuits, the 77-well process is used more widely than the p-well process. Another advan- 
tage of the 77 -well process is that it is upwardly compatible with BICMOS technology, 
which has been used for high-frequency (HF) and high-speed applications. 

In today’s more advanced CMOS technologies (0.18 p,m and below), both 77 -well and 
p-well has been used to optimize device characteristics of both 72 FET and pFET 1 . In 
a twin-well process, the p-well formation does not require additional mask and uses a 
reverse mask for the 72 -well 2 . It has been proved that a twin- well process provides more 
benefits than a single-well process even though the complexity of the process has been 
increased. Actually, recently, a triple-well process has been reported for RF applications, 
which will be discussed later. In Figure 2.1, we use the twin-well process to illustrate the 
fabrication of CMOS technology. 

Field region definition and channel stop implantation 

Regions with thick oxide are defined outside the active regions to increase the threshold 
voltages in these regions and reduce parasitic capacitance between interconnects and the 

1 As we will discuss later, triple-well process exists also for RFCMOS process. 

2 In a dual-well process with device options such as native MOSFET, the p-well is a reverse mask of both 
n-well and optional devices. 
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underlying silicon. Local oxidation of silicon (LOCOS) or shallow trench isolation (STI) 
technique selectively grows thick oxide in the so-called field regions while leaving a thin 
pad oxide only over active regions. Taking the LOCOS process as an example, a patterned 
nitride layer is formed by first depositing nitride across the entire wafer, then defining 
the field region by using a specific mask, and finally removing the nitride over the field 
region selectively by using an etch step. The photomask used for this step is called an 
inverse moat (positive) mask because it consists of a color reverse of the moat regions, 
that is, the mask codes for areas where the moat is absent, not where it is present. The 
pad oxide between the silicon and the nitride layer in the field regions are critical because 
the conditions of nitride growth introduce mechanical stresses that can cause dislocations 
in the silicon lattice and the pad oxide provides mechanical compliance preventing strain 
produced by the nitride growth from damaging the underlying silicon. 

Selective channel stop implantation underneath the field oxide usually requires to ensure 
that the threshold voltage in the field regions exceeds the operating voltages, p-substrate 
field regions receive a /My pc channel stop implant, while «-well field regions receive an 
n-type channel stop implant. After the channel stop implantation, all photoresist will be 
stripped from the wafer in preparation for isolation processing. 

Isolation processing 

LOCOS used to be the most popular isolation technique for CMOS fabrication (Kooi 
1991). Recently, however, STI is widely accepted as an effective approach to overcome 
the bird’s beak encroachment (see, for example, Nandakumar et al. 1998) and the low 
isolation punch-through voltage of LOCOS-based isolation for subquarter micron devices 
(Tsai et al. 1989). CMOS devices and fabrication processes based on LOCOS have been 
very well discussed in many books. In the following, we show the processing and device 
structure with STL 

In spite of its superior isolation characteristics and extensibility, the STI process is 
much more complicated than LOCOS. In general, trench-etch, chemical vapor deposition 
(CVD) oxide filling, and planarization using a chemical mechanical polishing (CMP) are 
considered as key technologies for STL STI requires a silicon-etch, so etch-damaged layers 
like crystallographic displacement and/or contamination of unwanted substances exist to 
a certain extent. Improper post-etching treatment can cause trap-assisted leakage current 
due to the damaged layers. To improve this, sidewall oxidation is used for crystallographic 
recovery of the silicon substrate. The roles of the sidewall oxidation include removal of 
plasma damage during trench-etch, active edge rounding for the suppression of parasitic 
channels, and reduction of interface traps for the decrease of junction leakage. Analogous 
to the gate oxidation process, in which a sacrificial oxidation step is adopted to remove 
damaged silicon formed during previous processes and, as a result, to improve the gate 
oxide quality, sidewall oxidation also requires an additional sacrificial oxidation step. 

Threshold adjust and gate oxidation 

Because of the existence of many different doping steps in the fabrication, the threshold 
voltage of a pFET without threshold adjust can be as high as —2 V, which deviates from 
the range (— 0.7 V~ —0.4 V) in modern MOS transistors. So threshold adjust implantation 
is needed to move the threshold voltage to the desired targets. 

Typically, two separate implants are used to adjust the threshold voltages (Vi) in nFETs 
and pFETs independently. After the wafer has been covered by photoresist, the Vj adjust 
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mask is used to open windows over areas where MOSFETs will form. The boron adjust 
implant penetrates the dummy gate oxide, formed to eliminate the gate oxide integrity 
failures caused by nitride deposition, to dope the channel region underlying the dummy 
oxide. After the V, adjust implantation, the dummy gate oxide is stripped away to reveal 
bare silicon in the active regions. 

The real gate oxide will be formed by using oxygen at high temperature to ensure the 
quality of the Si-SiCT interface by minimizing the charges due to surface states and other 
traps. This oxidation process must be well controlled to obtain a very thin gate oxide, 
which is around 2.5 nm for advanced technology. The gate oxide will be the dielectric 
of the MOSFETs; it also covers the source/drain regions when implantations occur in 
these regions. 

Polysilicon deposition and gate definition 

Currently, CMOS technology uses polysilicon as the material for the gate electrodes. 
It is heavily doped to reduce its resistivity. The typical sheet resistance for a polysili- 
con gate ranges between 20 and 40 O/sq and can be reduced by a factor of 10 with a 
silicide process, and even more with a metal stack process. Although polysilicon gates 
do not conduct significant DC, switching signals at the gates do produce substantial 
AC, and low resistance polysilicon will improve the switching speed of the circuits. RF 
applications also prefer low gate resistance to reduce HF noise in the circuits. Further- 
more, high doping concentration in the poly gate can help reduce/eliminate the so-called 
poly-depletion effect, which will influence both the DC and the AC characteristics of 
the device. 

The deposited polysilicon layer must be patterned using the poly mask according to the 
designed geometry for the gates. Since the gate dimension is the most important parameter 
for a MOSFET, the definition and etching of poly gates are considered the most critical 
photolithographic steps in CMOS fabrication, especially since the rapid advance in VLSI 
manufacturing has brought the minimum device feature size and the spacing between 
devices below the wavelength of the light source. To accurately define the poly gates 
with critical feature dimensions, advanced compensation mechanisms are required that 
perturb either the shape via optical proximity correction (OPC) or the phase via phase- 
shifting masks (PSM) of the transmitting apertures in the reticle when preparing the masks 
used for fabrication. 

Early CMOS technology used to adopt single poly gate (n + poly typically) for both 
nFETs and pFETs. In recent more advanced technologies, dual-gate (that is, different 
doped poly gates for nFETs and pFETs) technology has been widely used to improve the 
electrical performances in both nFETs and pFETs. In a dual-gate processing, a heavily 
doped n + poly layer is used as the gate electrodes for nFETs, while a heavily doped p + 
poly layer is the gate electrodes for pFETs. The introduction of the p + polysilicon gate 
in pFETs is to achieve surface channel operation, which offers the advantages of lower 
threshold voltage, superior short-channel effects, and subthreshold leakage compared to 
buried-channel pFETs. Also, shallower junction depth can be obtained as boron species 
are implanted in the self-aligned p + polysilicon gates. 

Source/drain implantation 

Source/drain implantation is another critical processing step determining the electrical 
performance of the device. n + source/drain implantation (NSDI) is needed for nFETs 
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and p + source/drain implantation (PSDI) is needed for pFETs, followed by an annealing 
process to activate the dopants implanted in the source/drain regions. The polysilicon gates 
completed in the previous process steps are used as masks to self-align the source/drain 
implants for both «FETs and pFETs. Before NSDI starts, photoresist is applied to the 
wafer, followed by patterning using the NSDI mask. Shallow and heavily doped n + 
regions are then formed by implanting arsenic in the /?FET S/D regions through the 
exposed gate oxide. The polysilicon gates block the S/D implants from the channel region 
underneath the gate and minimize the overlap capacitances between the gate and the source 
and between the gate and the drain. Once the NSDI is completed, the remaining photoresist 
is removed from the wafer. The PSDI begins with covering a photoresist layer patterned 
using the PSDI mask. Self-aligned shallow and heavily doped p + regions are formed by 
implanting boron in pFET S/D regions through the exposed gate oxide while keeping 
minimal overlap capacitances. After PSDI is completed, the photoresist is stripped from 
the wafer before the annealing process begins. The annealing process will activate the 
implanted dopants and slightly increase the oxide thickness over the source/drain regions. 
For CMOS technology, this annealing process is the final high-temperature step in the 
whole process. The junction depth is determined by the conditions of both the source/drain 
implantation and the annealing. 

Contacts and metallization 

When the source/drain region implantation is completed, a thick oxide layer is deposited 
as an insulation material between the active devices and the metal interconnects, so the 
metal interconnects can run over the field regions and the poly gates without influencing 
the device characteristics. Contact cuts are needed to open the thick oxide and form a good 
contact between the metal interconnect and the source/drain regions and the poly gates. 

After the wafer is again coated with photoresist, the contacts in source/drain regions 
and in polysilicon gates are patterned using the contact mask. As the device sizes shrink, 
the contact sizes also shrink, which makes the formation of the contacts important to 
ensure the yield in the manufacturing. 

Modern CMOS processes employ a metal silicidation technology to obtain good ohmic 
contacts in the gates and the source/drain regions, improving reliability by blocking any 
junction spiking. Metals with low sheet resistivity and contact resistance and high gate 
insulator reliability and heat stability are selected for use in silicided metallization. Since 
silicided polysilicon can reduce the gate resistance, it is suitable for digital IC applications. 
However, when dual-gate processing is implemented in CMOS technology, impurity inter- 
diffusion in the gate electrodes becomes an issue. Specific processing such as nitrogen 
ion implantation can be used to suppress this dopant interdiffusion in the gate electrodes. 
But this process requires a trade-off between dopant interdiffusion and gate depletion. 

Heat resistance is another issue to be considered for salicide technology to be used 
for analog applications. Salicide technology is fully suitable for the conventional CMOS 
digital/ ASIC ICs; however, metallization technology with high heat tolerance is needed 
because some process steps after salicide metallization need the high-temperature process 
to fabricate passive devices such as metal-insulator-metal (MIM) capacitors. Lower tem- 
perature process for MIM capacitors has been developed while advanced metal-stacked 
poly-gate structures are being developed. The metal-stacked gate structure can provide 
both low resistivity and high heat tolerance and is a potentially promising technology for 
analog applications. 




60 MOSFET FABRICATION 



Via and multilayer metal interconnects 

After finishing the contact and the metallization, the basic transistors have been fabricated. 
In the next steps, the wafers need to complete the so-called back-end processing. Basically, 
it consists of multilayer metal interconnect formation, via etching and interlayer dielectric 
(ILD) deposition. The purpose of the multilayer interconnect with ILD is to isolate the 
metal layers from each other and to provide the various electrical connections needed on 
a chip by metal wires and via contacts. In the current 0.13-|xm CMOS technology, the 
interconnects can extend up to eight metal layers. For digital processing, the back-end 
portion has become extremely critical in terms of the yield, reliability, and cost reduction. 

Following the opening of the contact windows, metal 1, the first layer of metal inter- 
connect, is deposited. It could be either aluminium or copper, depending on the technology 
node generation. Typically, for the 0.15-p.m technology and older, aluminium is used for 
all interconnect layers (even though some foundries offer copper for the top two metal 
layers) and for a 0.13-pm technology and newer, copper is used for all interconnect 
layers. After the metal deposition, a lithography step will be processed with a mask to 
define all the needed metal connections, so the metal outside these defined areas will be 
selectively etched. 

Other metal layers for multilevel interconnects are fabricated following the same pro- 
cedures. Between adjacent metal layers, a dielectric material is deposited for isolation. 
To provide the signal paths between the metal layers, depending on the design of the 
circuit, contact holes are created by opening windows throughout the dielectric layers. 
The contact holes between the metal layers are called “vias” to distinguish them from the 
“contact” for the first level of metal to the active areas and to polysilicon. 

In current advanced technologies such as 0.15 |xm and newer, copper has been used for 
interconnect. Compared with aluminium, copper is more difficult to fabricate to achieve 
high-quality interconnects with reliable performance. New processing techniques such as 
metal slotting and via pattern as well as the metal dummy filling at each metal layer have 
been applied. A detailed discussion on advanced interconnect technology will be outside 
the scope of this book. 



2.3 RF CMOS TECHNOLOGY 

The logic CMOS technology discussed earlier includes various flavors of MOSFETs 
and some resistors such as polysilicon resistors, diffusion resistors, and «-well resistors. 
However, an RF CMOS technology should incorporate passive device options for high- 
frequency (HF) applications such as resistors, capacitors, and inductors in addition to 
active devices such as MOSFETs and bipolar junction transistors (BJTs). Also, the varactor 
is an important component in RF technology. So far, RF CMOS technology is developed 
on the basis of the available digital planar CMOS processing by adding necessary process 
steps and device structures for RF applications. For example, process steps to form deep n- 
well are added to reduce the substrate coupling and noise figures of MOSFETs. Additional 
implantation is adopted in a varactor to increase the tuning range and the quality factor. A 
much thicker top metal layer is introduced to increase the Q factor of the inductor devices, 
and so on. Even though most RF CMOS technologies are currently developed on the 
basis of digital CMOS process compatibility and cost saving, sufficient reason exists for 
developing a separate high-quality CMOS process for RF applications. The first problem 
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is the lack of commonality between digital and analog device targets when optimizing 
the device performance in the process development. For digital CMOS, device parameters 
such as saturation current /<i saI , leakage current 7 0 ff, threshold voltage, and gate leakage are 
the most important ones to be optimized. But for analog applications, transconductance 
G,„, output conductance Gds, and device-matching behavior are more important. For 
example, 7 0 ff (of great priority to the digital designer) is not highly prioritized by the analog 
designer. The second problem is that the desired analog process optimization strategies 
may be contradictory to traditional CMOS scaling. A clear example here is 7 0n /7 0 ff versus 
G,„/Gds- Traditional CMOS scaling methodologies incorporate halo (pocket) implants to 
control short-channel effects. Flowever, halos have a detrimental effect on G„, /G'ds owing 
to the drain bias-induced modulation of the barrier created by the halo on the drain side 
of the device. Solutions such as using lateral work-function grading and asymmetric halos 
have been proposed to fix this problem. Flowever, each of these approaches adds to the cost 
and complexity of the process and pushes the devices away from the baseline technology. 
Thus, a “special” RF CMOS technology with optimized device performance specifically 
for RF/analog applications could coexist with “regular” RF CMOS based on the baseline 
digital process (Woerlee et al. 2001). Designers could select different RF CMOS processes 
depending on their circuit applications. In some cases, an RF CMOS technology based 
on the baseline digital process could be used to meet the design specifications while the 
cost of a “special” RF process is much higher. 

(1) Deep n-well options in an RF CMOS technology 

The schematic cross section of an n-typc MOSFET with the proposed deep n-wcll struc- 
ture and its key process flow are shown in Figure 2.2 (Su et al. 2001). The process is 
based on a regular logic CMOS technology, with the addition of a deep n-well implan- 
tation and mask. To minimize the disturbance to the DC behavior of MOS transistors, 
high-energy ion implantation (I/I) steps, followed by postimplant annealing, were used to 
form the deep «-well. Specifically, I/I steps consist of a 2-MeV I/I with an arsenic dose 
of 2 x 10 13 cm 2 to form a deep enough n-well so as not to disturb the doping profile 
of the inner p-well and a 1-MeV I/I with a lower arsenic dose was used together with 
the regular n-wcll implant and a flat plateau of deep «-well. A postimplant annealing is 




Figure 2.2 RF /iFET with deep n-well option. Reproduced from Su J.-G. et al. (2001) Improving 
the RF performance of 0.18 p,m CMOS with deep n-well implantation. IEEE Electron Device Lett., 
22(10), 481-483 







62 MOSFET FABRICATION 



required. As a result, the resultant //-well (i.e., regular //-well and deep //-well combined) 
completely surrounds the //-well region for junction isolation. 

(2) Varactor fabrication 

Varactors (variable capacitors) are very important components in RF circuits such as 
voltage-controlled oscillators (VCOs). Historically, varactors in integrated technology 
were constructed by diodes. In a dual-well CMOS process, there exist at least four types 
of diodes: n + /p-mb, n + / p- well, n-well/p-sub, and p + /n- well. However, p + /n - well 
diodes are used more than others because they offer higher tuning range than //-well/ p- 
sub junctions and provide better immunity against latch-up than n + /p - sub and n + /p - well 
junctions. In most RF CMOS technologies, both “free” varactors and high-quality var- 
actors are offered. “Free” varactors use the p + /n - well junction in the same way as that 
in MOSFETs and do not introduce any additional mask, but the tuning range and qual- 
ity factor is lower than that in high-quality varactors which require one additional mask 
to adjust the implant to achieve an optimized doping profile for the //-///-junction by 
considering together several figures of merit for a varactor such as tuning range, quality 
factor, breakdown voltage, leakage current, and so on. Figure 2.3 gives a cross section of 
a p + /n - well diode varactor. 

In addition to the diode varactors discussed above, other types of varactors have also 
been fabricated, such as MOS varactors. As in the case of diode varactors, both “free” and 
high-quality MOS varactors have been used in design. “Free” varactors are constructed by 
using MOS capacitors formed by an n + poly gate over an //-well. The designers can select 
different channel lengths and doping concentrations (different threshold voltage options 
offered in a specific process) to have a design trade-off between the tuning range and the 
quality factor. High-quality MOS varactors have been reported by modifying the “free” 
MOS varactors with added STI isolation between the channel (underneath the gate) and 
the /7-well contact regions. With increased mask and process costs, the tuning range of 
this type of varactors can be increased without reducing the quality factor a lot. Figure 2.4 
gives a cross section of a MOS varactor. 

Design and fabrication of varactors with high tuning range and quality factor are needed 
to be considered in developing an RF CMOS process. 

(3) MIM capacitor fabrication 

Analog/mixed-signal processes use four major types of capacitors: polysilicon-insulator- 
polysilicon (PIP) capacitors, vertical metal-insulator-metal (VMIM) capacitors. Flux MIM 




Figure 2.3 Cross section of a p + /n - well diode varactor 
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Figure 2.4 Cross section of a MOS varactor 



(FMIM) capacitors, and MOS-style (depletion or accumulation) capacitors. Many older 
technologies used PIP capacitors, which are not suitable for RF applications in the giga- 
hertz range because of both the resistive losses in the plates and the contacts and because 
of the parasitic capacitance between the passive element and the lossy silicon substrate. 
Also, the poly in PIP capacitors is typically implanted at higher doses than the CMOS 
source-drain regions in order to minimize poly-depletion effects. This requires extra pro- 
cessing costs because of additional lithography layers that need to be added to support 
the implants. So far the most popular analog/mixed-signal capacitor is the VMIM (see 
Figure 2.5). VMIM capacitors have the inherent advantage that they are metal without any 
poly-depletion and poly-gate loss issues and, if implemented at the last metal layer, have 
the entire ILD stack between them and the substrate, so the parasitic capacitance is much 
smaller. The VMIM capacitors were widely used in the 0.18-|im and older aluminum- 
interconnect-based processes. In recent years, they have been implemented in commercial 
CMOS Cu-damascene processes. The excellent linearity with voltage and temperature 
illustrates the popularity of the device as an analog element. However, the issues about 
yield and reliability need still to be resolved for copper-interconnect-based processes. 
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Figure 2.5 Cross section of a VMIM capacitor 
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One of the restrictions with MIM devices is that process technologies do not scale 
the vertical spacing in the back end nearly as fast as the lateral spacing. The reason 
is that digital circuit designs cannot tolerate large increases in the wiring capacitance 
from generation to generation. Lateral flux (finger) MIM capacitors solve this problem by 
using the lateral capacitance between the metal lines rather than the vertical capacitance 
between the different ILD layers. As a result, the capacitance is under design control 
and scales more effectively with the technology. Also, the FMIM capacitors do not need 
any extra mask to define the bottom and top plates, so the cost is lower. However, the 
matching property of the lateral FMIM is about one order (the actual numbers depend 
on the technology node) worse than that of a VMIM capacitor, so such FMIM capacitors 
may not be suitable for some analog applications that require precise matching behavior 
to the MIM capacitors. Figure 2.6 gives the cross section of an FMIM capacitor for a 
four-metal layer process. 

Another of the limitations of the MIM device is the small capacitance per unit area due 
to the thickness of the insulator dielectric. Many designers take advantage of thin gate 
oxide processes to achieve high capacitance per unit area by using MOS capacitors. The 
disadvantage of using MOS capacitors is the high series resistance of a MOS capacitor due 
to the bottom plate that is formed with the doped channel/substrate. Also, the high gate 
leakage currents in modern devices with scaled oxides make MOS capacitors excessively 
leaky, which should be considered when using them for some leakage-sensitive designs 
such as for wireless applications. 

(4) Resistor fabrication 

Precision resistors are key passive elements in both digital and analog circuits (Ulrich 
et al. 2000). Different types of resistors exist in a CMOS process, such as n and p 
polysilicon resistors, n and p diffusion resistors, n-well resistors, and metal thin film 



Plate A 



Plate B 











Dielectric 








/i-well 







p-substrate 

Figure 2.6 Cross section of a FMIM capacitor 
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resistors. Depending on the doping and the salicidation control, resistors of different 
resistance values can be fabricated. The simultaneous presence of both poly and metal 
resistors has added value in a CMOS process, because the metal resistors are at the top of 
the stack and the poly resistors at the bottom. The two widely separated locations allow 
designers to choose a resistor that minimizes parasitics for their particular circuit. 

Typically, a CMOS process offers both low and high value polysilicon resistors. The 
low value resistors are formed by silicided polysilicon film, ffigh value poly resistors are 
fabricated from unsilicided polysilicon by blocking the silicide formation from the polysil- 
icon films. Since the resistance of polysilicon-silicided (polycide) resistors tends to be very 
low (5-15 C2/sq) and the voltage coefficient tends to be relatively high (100-600 ppm/V), 
there is a strong tendency to use unsilicided (or silicide-blocked) resistors. In a typical 
silicide-blocked resistor, the center of the device is silicide-blocked and the end portions 
are left unblocked (see Figure 2.7). Thus, the end portions either receive the conventional 
silicide processing for a contact pad or receive specific optimized processing procedures 
for certain applications. The silicide-blocking layer is usually an oxide or nitride and is 
frequently chosen to leverage a preexisting layer elsewhere in the process. Existence of 
a silicide-blocking layer also enables devices such as silicide-blocked diffusion resistors 
and silicide-blocked MOS devices. 

Polysilicon resistors are usually placed on a field oxide. In technologies with thin 
field oxides (such as local oxidation of silicon (LOCOS)), there is significant electrical 
interaction through the field oxide and parasitic capacitances as well as depletion of the 
bottom of the resistor, which produces a voltage-dependent resistance change. All these 
effects must be considered in the resistor design. However, such effects are significantly 
reduced with the thicker oxides (3000-6000 A), characteristic of STI processes, which 
are used in more advanced technologies. It should be noted that the sheet resistance, as 
well as the thermal and voltage coefficients of silicide-blocked polysilicon resistors, are 
very process-dependent. Implant conditions, grain boundary size, thermal activation, and 
end-portion silicide quality can all impact key polysilicon resistor parameters. Therefore, 
the reported values for the parameters of poly resistors vary widely. 

Metal thin film resistors can be built at any of the traditional metal layers. In addition, 
TaN thin film is frequently used as a precise thin film resistor owing to its easy availability 
in a Cu-damascene process where it is used as a Cu-diffusion barrier or even as an 
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Figure 2.7 Cross section of an unsilicided poly resistor 
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aluminum back end (the TaN film is deposited from reactively sputtered Ta in an N 2 
ambient). The parasitic capacitance of a TaN resistor fabricated in the back-end-of-the- 
line (BEOL) is significantly reduced relative to a polysilicon resistor. Another reason 
why TaN is attractive to both the process developers and circuit designers is because it 
exhibits a temperature coefficient of resistance (TCR)-versus-resistivity relationship that 
ranges from roughly 500 ppm/C at 50 f2/sq to roughly —500 ppm/C at 400 O/sq and is 
attributed to the transition from metallic conduction (positive TCR) to hopping conduction 
(negative TCR). 

(5) High-Q inductor fabrication 

Inductors are critical components in analog/mixed-signal design. Small-valued, precise, 
high- <2 inductors are employed in circuits such as RF transceivers. Larger, lower- Q 
devices have functions such as impedance matching and gain control. Significant research 
has been done on monolithic integration of inductors (Burghartz el al. 1996), and in recent 
years there has been increasing use of inductors in state-of-the-art CMOS processes. 
Spiral inductors (see Figure 2.8) can be fabricated with a conventional MOS process with 
negligible modifications to the design rules. A minimum of two metal layers is required, 
one to form the spiral and the other to form the underpass. 

To minimize parasitic capacitance to the substrate, the top metal layer is the usual 
choice for the main spiral. The most critical factor in inductor design is the optimization 
of the inductor Q at the design frequency. Q, or the “quality factor,” is the ratio of the 
imaginary to the real part of the impedance ( Q — Im(Z)/Re(Z)) and represents the ratio 
of the useful magnetic stored energy over the average dissipation for one cycle of the 
signal propagation. Note that determining the geometry and area required to deliver an 
optimized Q at the design frequency is not a straightforward process. The most difficult 
factor in inductor process design is minimization of the impact of parasitic elements. 
Real inductors have parasitic resistance and capacitance. The parasitic resistance dissi- 
pates energy through ohmic loss, while the parasitic capacitance stores unwanted energy. 
At high frequencies, the skin effect causes a nonuniform current distribution in the metal 




Figure 2.8 Illustration of a spiral inductor 
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segments, which introduces (among other factors) a frequency-dependent contribution to 
the parasitic resistance. Furthermore, electromagnetic effects caused by the Faraday effect 
introduce parasitic currents (eddy currents) in the silicon as well, adding an additional 
frequency dependency in the resistance. The parasitic resistance is primarily driven by 
ohmic resistive losses in the thin patterned metal layers. Parasitic resistance can be mod- 
ulated both by design (trading off inductor area for inductor line width) and by process 
(increasing the thickness of the metal and/or improving a Cu-damascene polish process 
to minimize dishing and thus permit wider metal lines). The capacitive-induced loss is 
driven both by the C ox between the inductor and the substrate and by the lossy properties 
of the substrate. (At high frequencies the current flows through C ox and into the lossy 
substrate. The resulting dissipation adds a real component to the imaginary inductive 
impedance and degrades the Q.) Minimizing this capacitance typically means separating 
the inductor as far as possible from the lossy silicon (usually by placing the inductor 
in the top metal layer). Recent advancements in low-k processes for digital CMOS also 
carry significant benefit (up to 4X improvement in Q for SiLK* compared to conventional 
oxide ILD.) Minimizing the substrate loss is more complex. As the frequency increases 
to where the skin depth is on the order of the substrate thickness, eddy currents in the 
substrate become a major loss mechanism. (This magnetically induced loss can be thought 
of as transformer action between a lossy primary and a lossy secondary.) Mitigating eddy 
current loss can be quite difficult. There are a number of potential techniques including 
solid and patterned ground shields, multilevel metallizations to build vertical solenoids, as 
well as minimizing doping levels under the inductor. Note that since the eddy current loss 
is approximately proportional to the cube of the inductor diameter, strategies to minimize 
resistive parasitics by making large inductors (as is common in GaAs) are less effective 
in CMOS owing to the more conductive Si substrates. Fabrication of high-Q inductors in 
CMOS technology is a challenging effort. 
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3 

RF Modeling 



3.1 INTRODUCTION 

Advances in CMOS fabrications have resulted in deep submicron transistors with higher 
transit frequencies and lower noise figures. Radio-frequency (RF) designers have already 
started to explore the use of CMOS technology in RF circuits. This advanced performance 
of MOSFETs is attractive for high-frequency (HF) circuit design in view of a system- 
on-a-chip realization, where digital, mixed-signal base-band and FIF transceiver blocks 
would be integrated on a single chip. Besides the ability to integrate RF circuits with 
other analog and logic circuits with the intention of reducing the cost by eliminating the 
sometimes expensive packaging, other advantages offered by silicon CMOS technologies 
are also interesting, such as the low cost due to the volume of wafers processed and 
the low power consumption feature of MOSFETs, which makes it suitable for portable 
applications. 

To have an efficient design environment, design tools with accurate models for devices 
and interconnect parasitics are essential. It has been known that for analog and RF appli- 
cations the accuracy of circuit simulations is strongly determined by the device models. 
Accurate device models become crucial to correctly predict the circuit performance. 

In most of the commercially available circuit simulators, the MOS transistor models 
have originally been developed for digital and low-frequency analog circuit design (see, 
for example, Cheng et at. (1997a) and MOS9 Manual (2001)), which focus on the DC 
drain current, conductances, and intrinsic charge/capacitance behavior up to the mega- 
hertz range. However, as the operating frequency increases into the gigahertz range, the 
importance of the extrinsic components rivals that of the intrinsic counterparts. Therefore, 
an RF model with the consideration of the HF behavior of both intrinsic and extrinsic 
components in MOSFETs is extremely important for achieving accurate and predictive 
results in the simulation of a designed circuit. 

So far, most compact MOSFET models do not include the gate resistance Rq. However, 
the thermal noise contributed by the gate resistance should be considered as MOS tran- 
sistors approach gigahertz frequencies, and the resistive and capacitive (RC) effects at the 
gate should be well modeled since both of these effects are important in designing radio- 
frequency CMOS circuits. As shown in Figure 3.1, the gate resistance component will 
significantly affect the input admittance at RF, so a model without R G cannot accurately 
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Figure 3.1 The model without the gate resistance cannot predict the measured Y n characteristics. 
Reproduced from Cheng (2002b) MOSFET Modeling for RF IC Design, in CMOS RF Modeling, 
Characterization and Applications, Jamal D. M. and Fjeldly T. A., eds., World Scientific Publish- 
ing, Singapore 




Frequency (GHz) 

Figure 3.2 The model without the substrate resistances cannot predict measured Y 2 2 character- 
istics. Reproduced from Cheng (2002b) MOSFET Modeling for RF IC Design, in CMOS RF 
Modeling, Characterization and Applications, Jamal D. M. and Fjeldly T. A., eds.. World Scientific 
Publishing, Singapore 

predict the HF characteristics of the device. It is very crucial because one may use this 
resistance for impedance matching to achieve maximum power transfer. Also, the thermal 
noise introduced by the gate resistance increases the noise figure of the transistor. It is 
an important noise source to be considered when optimizing the noise performance of an 
RF circuit. Furthermore, the gate resistance also reduces / max (the frequency at which the 
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maximum available power gain of the device equals to 1), which is an important device 
parameter in RF circuit design in addition to / T , the frequency at which the current gain 
of the device equals to 1. 

Another important component that almost all of the compact models implemented 
in commercial circuit simulators do not account for is the substrate resistance. Actually, 
substrate-coupling effects through the drain and source junctions and these substrate resis- 
tance components play an important role in the contribution to the output admittance, so 
the inclusion of these substrate components in an RF model is needed. This effective 
admittance of the substrate network can contribute 50% of the total output admittance 
(see Jen et al. (1998)). As shown in Figure 3.2, a MOSFET model without the substrate 
resistance components cannot predict the frequency dependency of the output admittance 
of the device, so the simulation with such a model will give misleading simulation results 
of the output admittance when the device operation frequency is in the gigahertz range. 



3.2 EQUIVALENT CIRCUIT REPRESENTATION 
OF MOS TRANSISTORS 

As shown in Figure 3.3, a four terminal MOSFET can be divided into two portions: 
intrinsic part and extrinsic part. The extrinsic part consists of all the parasitic compo- 
nents, such as the gate resistance Rq, gate/source overlap capacitance Cgso> gate/drain 
overlap capacitance Cgdo. gate/bulk overlap capacitance Cgbo, source series resistance 
R s , drain series resistance R D , source/bulk junction diode D s B , drain/bulk junction diode 
D db , and substrate resistances R SB , Rdb> and Rdsb- The intrinsic part is the core of the 
device without including those parasitics. Even though it would be desirable to design 
and fabricate MOSFETs without those parasitics, they cannot be avoided in reality. Some 
of them may be not noticeable in DC and low-frequency operation. However, they will 
influence significantly the device performance at HF. 

Equivalent circuits (ECs) have been an effective approach to analyze the electrical 
behavior of a device by representing the important components. In this section, we discuss 




Figure 3.3 A MOSFET schematic cross section with the parasitic components. Reproduced from 
Cheng et al. (2000b) MOSFET modeling for RF circuit design. Proceedings of the 2000 Third IEEE 
International Caracas Conference on Devices, Circuits and Systems , D23/1-D23/8 
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the ECs for both the intrinsic device (without the parasitics) and the extrinsic device with 
various parasitic components. 

For an intrinsic device, AC small-signal currents referring to the source of the device 
can be calculated by the following: 



*Gi 




J£t>C GGi —jto>C GDi — ymCoBi 




VGSi 


IDi 
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Gm — jtoC DGi Gds + j(oC DDi G m b — j 0)C DBi 




iDSi 
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— jteC BGi —jcoCn Di jcoCn Bi 




^BSi 



where u G si, udsg and i'bsi are the AC voltages at the intrinsic gate, at the intrinsic drain, 
and at the intrinsic bulk (all referring to the intrinsic source); / Gi , i Di , and / Bj are the 
alternating currents (AC) through the intrinsic gate, through the intrinsic drain, and through 
the intrinsic bulk; G,„, Gds, and G m b are the transconductance, channel conductance, and 
bulk transconductance of the device, respectively; C Ay ; are intrinsic capacitances between 
the terminals with the following definitions: 

9 Gy 

Cxy = — - — when r/j, (3.2) 

OVy 

dQ x 

C xy — when x = y. (3.3) 

dv y 

Equation (3.1) can be rewritten as the following: 



*Gi = jo)C GGi^GSi — ./A>C GD jU DS j ~ ./A>C GBi U B Si (3.4) 

( Di = G m V GSi + Gos^DSi + G m bUBSi — iwCDGi^GSi + ./ CnCoDi t^DSi — i^CDBi^BSi (3-5) 
f Bi = —j(oCB Gi^GSi — ./cuCBDilDSi + JCuCBBilBSi- (3.6) 

In Eq. (3.1), we assume that the components between the gate and the other terminals 
can be considered as purely capacitive with infinite resistance, so the gate current in 
Eq. (3.4) does not contain any conductive current component. Similarly, the components 
between the bulk and the other terminals can be also considered as purely capacitive with 
infinite resistance, so the bulk current in Eq. (3.6) does not contain any conductive current 
component. Those assumptions can usually hold for an intrinsic MOSFET because of the 
very low leakage currents through the gate to other terminals and through the bulk to 
other terminals in a MOSFET fabricated with current advanced technology. 

To derive an EC from the above equations, we rearrange the above equations in the 
following forms: 



*Gi = jO)Cc si^GSi + j 0 )C GDi^GDi + j 0 )C GBi^GBi, (3.7) 

*Di = (G,„ — j(i>C ln )v QSi + j n^C G Di UoGi + (G,„b — j( 0 C m b)VBSi 

+ 7<wCBDi ^DBi + (GdS + J<wCsDi)l l DSi> (3.8) 

( Bi = jo)C m gb^GBi + yTuGBSi^BSi + j ^C GB j U BG j + ./ 6jC B pj U B Di (3.9) 

where C m , C m b, and C mg b are the differences of the transcapacitances between the drain 
and the gate, between the drain and the bulk, and between the gate and the bulk, and are 
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given by 

C m = C DGi - C GDi , (3.10) 

C mb = C DBi - C BDi , (3.11) 

C m gb = C G Bi — Cb G i- (3.12) 

Csm and Cusi are intrinsic transcapacitances between the source and the drain, and between 
the bulk and the source, and have the following relationships with other capacitances: 

CsDi = Cf)Di — CBDi ~ C G Di, (3.13) 

CflSi = CfiBi — Cb G i — CBDi- (3.14) 

According to Eqs. (3.7)-(3.9), an EC referring to the source can be derived as shown 
in Figure 3.4, in which several current components contributed by the transcapacitances 
are included in the EC. As shown in Figure 3.3, parasitic capacitances such as the over- 
laps of gate-to-source/drain/bulk and the junction capacitances from the source/bulk and 
drain/bulk diodes are not negligible in a MOSFET and must be included in the EC to 




Figure 3.4 An equivalent circuit for an intrinsic MOSFET. Reproduced from Cheng (2002b) 
MOSFET Modeling for RF IC Design, in CMOS RF Modeling, Characterization and Applications, 
Jamal D. M. and Fjeldly T. A., eds., World Scientific Publishing, Singapore 
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describe the device behavior at HF. With the inclusion of those extrinsic capacitances, 
the EC for a MOSFET can be given in Figure 3.4, that is, 



CgS = Cosi + CgSo, 


(3.15) 


CgD = CoDi + CgDo, 


(3.16) 


Cob = OtBi + CgBo> 


(3.17) 


CbS = CBSi + CjBS. 


(3.18) 


CbD = CflDi + CjBD- 


(3.19) 



In a MOSFET model for DC and low-frequency applications, the parasitic resistances 
at the gate and substrate can be ignored with little influence on the simulation accuracy. 
Usually, the parasitic resistances at the source and drain can be treated as “virtual” com- 
ponents by incorporating them in the 7-V equation to account for the influence of the 
voltage drops at those resistances (see Cheng et al. (1997b)). At HF, however, these par- 
asitic resistances will influence the device performance significantly and they all should 
be modeled and included in the EC for the device. 

The gate resistance is in principle a bias-independent component at DC and low fre- 
quency, but may contain the contribution of an additional component with bias dependence 
at HF, as discussed by Jin et al. (1998) and Cheng et al. (2001a). The parasitic resistances 
at the source and drain consist of several parts as we will discuss later and can be also 
treated as bias-independent components even though they do have some bias dependence 
depending on the device structure and process conditions. The resistances in the substrate 
can be modeled by different EC networks, such as five-resistor network, four-resistor 
network proposed by Liu et al. (1997), three-resistor network proposed by Cheng et al. 
(1998), two-resistor network by Ou et al. (1998), and one-resistor network by Tin et al. 
(1999), as shown in Figures 3.5 to 3.9. The four- and five-resistor networks are more 
accurate and can be valid up to higher frequency, but the analysis and parameter extrac- 
tion of the components are very complex. The one- and two-resistor networks introduce 
fewer components and are easier for the analysis and parameter extraction. However, 
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Figure 3.5 Five-resistor substrate network 
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Figure 3.6 Four-resistor substrate network. Reproduced from Liu W. et al. (1997) R.F.MOSFET 
modeling accounting for distributed substrate and channel resistances with emphasis on the BSIM3v3 
SPICE model, Tech. Dig. IEDM, 309-312 
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Figure 3.7 Three-resistor substrate network. Reproduced from Cheng Y. (1998) RF modeling 
issues of deep-submicron MOS-FETs for circuit design, Proc. of the IEEE International Conference 
on Solid-State and Integrated Circuit Technology, pp. 416-419 

they may be less accurate when the operating frequency is increased. The three-resistor 
network is a compromise among these substrate networks. It can ensure the accuracy 
up to 10 GHz while maintaining a simple analysis and parameter extraction. However, 
it should be pointed out that the intrinsic bulk has been shifted to the end of 7?dsb- as 
shown in Figure 3.7, instead of located somewhere along the resistor /?dsb- It has been 
concluded that this approximation does not influence much the simulation accuracy (see 
Enz and Cheng (2000)). 

With further consideration of parasitic resistances at the drain, at the gate, at the source, 
and at the substrate, a complete lumped EC for a MOSFET at HF can be constructed and 
is shown in Figure 3.10. 
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Figure 3.8 Two-resistor substrate network. Reproduced from Ou J.-J. et al. (1998) CMOS RF 
modeling for GHz communication IC’s, Proc. of the VLSI Symposium on Technology, pp. 94. 95 
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Figure 3.9 One-resistor substrate network. Reproduced from Tin S. F. et al. (1999) Substrate net- 
work modeling for CMOS RF circuit simulation, Proc. IEEE Custom Integrated Circuits Conference, 
pp. 583-586 

The EC shown in Figure 3.10 can be used to understand and analyze the HF behavior 
of a MOSFET. In order to implement this EC in a SPICE simulator, a subcircuit approach 
has to be used. 

In the subcircuit, the characteristics of the intrinsic device is described by a MOS 
transistor compact model implemented in the circuit simulator, and all the extrinsic 
components have to be located outside the intrinsic device, so that the MOS transis- 
tor symbol in the subcircuit only represents the intrinsic part of the device 1 . For example, 
(1) the source and drain series resistors are added outside the MOS intrinsic device to 

1 It may include the overlap capacitances at the source, at the drain, and at the bulk, depending on the intrinsic 
compact MOSFET model used in the implementation. 
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Figure 3.10 An equivalent circuit with both intrinsic and extrinsic components. Reproduced from 
Cheng (2002b) MOSFET Modeling for RF IC Design, in CMOS RF Modeling, Characterization 
and Applications , Jamal D. M. and Fjeldly T. A., eds., World Scientific Publishing, Singapore 



make them visible in AC simulation (in most compact models, since the internal series 
resistances are only “virtual” resistances embedded in the I-V model to account for the 
DC voltage drop across the source and drain resistances in calculating the drain current, 
they do not add any poles and are therefore invisible for AC simulation); (2) the gate resis- 
tance is added to the subcircuit model (usually Rq is not part of the MOS compact model, 
but plays a fundamental role in RF circuits as we discussed in Section 3.1); (3) the sub- 
strate resistors are added to account for the signal coupling through the substrate; (4) two 
external diodes are added in order to account for the influence of the substrate resistance 
at HF (the source-to-bulk and drain-to-bulk diodes are part of the compact model but their 
anodes are connected to the same substrate node, which will short the AC signal at FIF, 
see Liu et al. (1997), so the diodes internal to the compact model should be turned off). 
With the above considerations, a subcircuit that represents an RF MOSFET in a circuit 
simulator can be defined and is shown in Figure 3.1 1. Note that the intrinsic substrate node 
should be connected at some point along the resistor Rdsb- but simulations have shown 
that connecting the intrinsic substrate to the source or the drain side has little influence 
on the simulated AC parameters. In some RF models (see, for example, Enz and Cheng 
(2000)), the intrinsic substrate has been connected to the source side in order to save one 
node and one component for the subcircuit model. Two external overlap capacitances, 
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Figure 3.11 A subcircuit that can be implemented in a circuit simulator. Reproduced from Cheng 
(2002a) High frequency small signal AC and noise modeling of MOSFETs for RF IC design, IEEE 
Trans. Electron Devices, 49(3), 400-408 

Cqsp and Cqdp as shown in Figure 3.11, with bias dependence can be added but this 
is not always required, depending on the compact model used. For example, BSIM3v3 
accounts for bias-dependent overlap capacitances that, if extracted correctly, have shown 
a sufficient accuracy. However, by adding these external capacitances, the inaccuracies of 
the intrinsic capacitance model appearing for short-channel devices can be corrected. In 
the next section, we will discuss the modeling of these intrinsic and extrinsic components 
shown in Figure 3.10. 



3.3 HIGH-FREQUENCY BEHAVIOR OF MOS 
TRANSISTORS AND AC SMALL-SIGNAL 
MODELING 

Compared with the MOSFET models for both digital and analog applications at low 
frequency, compact models for HF applications are more difficult to develop owing to 
the additional requirements of bias dependence and geometry scaling of the parasitic 
components as well as the requirements of accurate prediction of the distortion and noise 
behavior. A common modeling approach for RF applications is to build subcircuits based 
on the intrinsic MOSFET that has been modeled well for analog applications (see, for 
example, Liu et al. (1997), Cheng et al. (1998), and Enz and Cheng (2000)). The accuracy 
of such a model depends on how to establish subcircuits with the correct understanding of 
the device physics in HF operation, how to model the HF behavior of intrinsic devices and 
extrinsic parasitics, and how to extract parameters appropriately for the elements of the 
subcircuit. A reliable and physics-based parameter extraction methodology based on the 
appropriate characterization techniques is another important portion of the RF modeling 
to determine the model parameters and generate scaleable models for circuit optimization. 

Currently, most RF modeling activities focus on the above subcircuit approach based 
on different compact MOSFET models developed for digital and low-frequency analog 
applications, such as EKV (Enz et al. 1995), MOS9 (MOS9 Manual 2001), and BSIM3v3 
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(Cheng et al. (1997b)). Several MOSFET models for RF applications have been reported 
(see, for example, Liu et al. (1997), Ou et al. (1998), Pehlke et al. (1998), Cheng et al. 
(1998), and Enz and Cheng (2000)). With added parasitic components at the gate, at the 
source, at the drain, and at the substrate, these models can reasonably well predict the 
HF AC small-signal characteristics of short-channel (<0.5 p,m) devices up to lOGFIz. 
However, the RF MOSFET modeling is still at a preliminary stage compared with the 
modeling work for digital and low-frequency analog applications. Efforts from both indus- 
try and universities are needed to bring RF MOSFET models to a mature level in further 
improving the RF models in describing the AC characteristics more accurately, and in 
improving the prediction of noise characteristics, distortion behavior, and non-quasi-static 
(NQS) behavior. 



3.3.1 Requirements for MOSFET Modeling 
for RF Applications 

Compared with the MOSFET modeling for digital and low-frequency analog applications, 
the HF modeling of MOSFETs is more challenging. All the requirements for a MOSFET 
model in low-frequency application, such as continuity, accuracy, and scalability of the 
DC and capacitance models, should be maintained in an RF model (see Cheng and Hu 
(1999)). In addition, there are further important requirements to the RF models: 

1. The model should accurately predict bias dependence of small-signal parameters at 
HF operation. 

2. The model should correctly describe the nonlinear behavior of the devices in order to 
permit accurate simulation of intermodulation distortion and high-speed large-signal 
operation. 

3. The model should correctly and accurately predict HF noise, which is important for 
the design of, for example, low noise amplifiers (LNAs). 

4. The model should include the NQS effect, so it can describe the device behavior at 
very high-frequency range in which NQS effect cannot be ignored for a model to 
behave correctly and will degrade the device performance significantly. 

5. The components in the developed EC model should be physics-based and geometrically 
scaleable so that the model can be used in predictive and statistical modeling for RF 
applications. 

To achieve the above, the model for the intrinsic device should be derived with the 
inclusions of most (if not all) important physical effects in a modem MOSFET, such 
as normal and reverse short-channel and narrow width effects, channel-length modu- 
lation. drain-induced barrier lowering (DIBL), velocity saturation, mobility degradation 
due to vertical electric field, impact ionization, band-to-band tunneling, polysilicon deple- 
tion, velocity overshoot, self-heating, and channel quantization. Also, the continuities of 
small-signal parameters such as transconductance G m , channel conductance Gds, and the 
intrinsic transcapacitances must be modeled properly. Many MOSFET models, including 
MOS9, EKV, and BSIM3v3 have been developed for digital, analog, and mixed-signal 
applications. Recently, they all are extended for use in RF applications. 
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3.3.2 Modeling of the Intrinsic Components 

Compact models including many mathematical equations for different physical mecha- 
nisms have been discussed in other chapters in this book. It has been found that the 
model accuracy in fittings of HF small-signal parameters and large-signal distortion of an 
RF MOSFET is basically determined by the DC and capacitance models. Here we only 
give a brief discussion on important modeling concepts without getting into the detailed 
equation derivation and physics analysis. 

As a must for the backbone of the model, the electric field, the channel charge, and the 
mobility need to be modeled carefully to describe the current characteristics accurately 
and physically, on the basis of which, different physical effects can be added in the model. 

In modeling the channel charge, physical effects such as short-channel effect, narrow 
width effect, nonuniform doping effect, quantization effect, and so on should be accounted 
for in order to describe the charge characteristics accurately in today’s devices. There are 
two types of charge models: one can be called threshold-voltage ( Vth)-based models and 
the other can be called surface-potential ( \jr s )-based models (Boothroyd et al. 1991). i// v - 
based charge models are based on the analysis of the surface potential that will appear 
in the I-V model to describe charge characteristics with the influence of many physical 
effects. V lh -based charge models are derived also by solving the surface potential with the 
consideration of those physical effects, but finally V, h is used instead of \jr s in the charge 
(and hence I-V) model to account for the influence of some process parameters such as 
oxide thickness and doping and device parameters such as channel length and width. In 
both models, the continuities of the charge and its derivatives should be modeled carefully 
for the I-V model to have good continuity and to predict correct distortion behavior of 
the devices (Langevelde and Klaassen 1997). 

Mobility is another key parameter in MOSFET modeling. It will influence the accuracy 
and distortion behavior of the model significantly. The relationship between the carrier 
mobility and the electric field in MOSFETs has been well studied (see, for example, Liang 
et al. (1986) and Chen et al. (1996)). Three scattering mechanisms have been proposed 
to describe the dependence of mobility on the electric field. Each mechanism may be 
dominant under specific conditions of doping concentration, temperature, and biases as 
shown in Figure 3.12. 

It has been realized that an accurate and physical description of a mobility model in 
compact MOSFET RF models for circuit simulation is essential for distortion analysis. It is 
also suggested that different models for electron and hole mobilities should be developed 
because of the difference in quantum-mechanical behavior of electrons and holes in the 
inversion layer in today’s MOSFETs as discussed by Langevelde and Klaassen (1997). 

On the basis of the charge and mobility models, complete I-V equations can be devel- 
oped with further inclusions of many important physical effects such as short-channel and 
narrow width effects, velocity saturation and overshoot, poly-depletion effect, quantiza- 
tion effect, and so on. In order to meet the requirements for both AC small-signal and 
larger-signal applications, the continuity and distortion behavior of the I-V model should 
be ensured in deriving the equations when including these physical effects. 

In real circuit operation, the device operates under time-varying terminal voltages. 
Depending on the magnitude of the time-varying signals, the dynamic operation can be 
classified as large-signal operation and small-signal operation. Both types of dynamic 
operation are influenced by the capacitive effects of the device. 
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Figure 3.12 Mobility behavior influenced by different scattering mechanisms, depending on the 
bias and temperature conditions. Reproduced from Takagi S. et al. (1994) On the universality of 
inversion layer mobility in Si MOSFET's: part I - effects of substrate impurity concentration, IEEE 
Trans. Electron Devices, ED-41, 2357-2362 



Many MOSFET intrinsic capacitance models have been developed. Basically, they 
can be categorized into two groups: (1) Meyer and Meyer-like capacitance models (see, 
for example, Meyer (1971)) and (2) charge-based capacitance models (see Sheu et al. 
(1984)). The advantages and shortcomings of the two groups of models have been well 
discussed and both of them have been implemented in circuit simulators. The Meyer and 
Meyer-like models are simpler than the charge-based models, so they are efficient and 
faster in computations. But they assume that the capacitances in the intrinsic MOSFET 
are reciprocal, which is not the case in real devices (see the discussion by Cheng and Flu 
(1999)), and earlier models based on this assumption cannot ensure charge conservation 
(see Yang et al. (1983)). Charge-based models ensure charge conservation and consider 
the nonreciprocal property of the capacitances in a MOSFET. These features are required 
to describe the capacitive effects in a MOSFET, especially for RF applications in which 
the influence of transcapacitances are critical and should be considered in the model. But 
usually the charge-based capacitance models require complex equations to describe all of 
the 16 capacitances in a MOSFET with four terminals, as given in the following: 



r dQi 
' J dVa 



i^j i, j = G, D, S, B, 



Cij - 



dQi . . 

1 = 7 

dVij 



(3.20) 



(3.21) 





82 RF MODELING 



The development of an intrinsic capacitance model of modem MOSFETs is another 
challenging issue in RF modeling. To meet the needs in RF applications, besides ensuring 
charge conservation and nonreciprocity, an intrinsic MOSFET capacitance model should 
at least have the following features: (1) guaranteeing model continuity and smoothness 
in all the bias regions, (2) providing model accuracy for devices with different geometry 
and different bias conditions, and (3) ensuring model symmetry at Eds = 0 V. 

Some comparisons between the MOSFET capacitance models and the measured data 
have been reported (see, for example. Ward (1981)). However, a complete verification of 
the bias and geometry dependencies of those capacitance models has not been seen. It has 
been found that some engineering approaches have to be used to improve the accuracy of 
the capacitance model if the intrinsic capacitance model cannot describe the device behav- 
ior accurately. Recently, the model continuity has been improved greatly. Many discon- 
tinuity issues in earlier capacitance models have been fixed. However, most capacitance 
models still cannot ensure the model symmetry when Vds = 0. In Figures 3.13 and 3.14, 
the asymmetries of the capacitance model in BSIM3v3 are shown for Cgs = Cgd, Cdd and 
Css and for Cbd and Cbs (see, for example, Cheng and Hu (1999)). It has been known that 
a MOSFET should be symmetric for some capacitances at Eds = 0, that is, Cdd = Css 
and Cbd = Cbs- The asymmetric issue in the capacitance model is apparently nonphysi- 
cal and may cause convergence and accuracy problem in the simulation. This issue may 
become more critical in the model for RF applications because the devices are often biased 
in the region of Eds ^ 0 V in some applications such as switching. Efforts have been made 
based on the source-referenced approach, the bulk-referenced approach, and the surface 
potential-oriented approaches to improve the symmetry property of the models (see Tsi- 
vidis (1987)). The development of advanced capacitance models with good continuity, 
symmetry, accuracy, and scalability is still a challenge for the model developers. 




Figure 3.13 Simulated Css and Cdd as a function of Eds- Css / Cdd when Eds = 0- Repro- 
duced from Cheng and Hu (1999) MOSFET Modeling & BSIM3 User’s Guide , Kluwer Academic 
Publishers, Norwell, MA 
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Figure 3.14 Simulated Css and Cdd as a function of Vbs- Cbs / Cbd when tbs = 0. Repro- 
duced from Cheng and Hu (1999) MOSFET Modeling & BSIM3 User’s Guide, Kluwer Academic 
Publishers, Norwell, MA 



3.3.3 HF Behavior and Modeling of the Extrinsic Components 

For an AC small-signal model at RF, the understanding and the modeling of parasitics 
are very important. The models for these parasitic components should be physics-based 
and linked to process and geometry information to ensure the scalability and prediction 
capabilities of the model. Also, simple subcircuits are preferred to reduce the simulation 
time and to make parameter extraction easier. 

Besides the development of a physical and accurate intrinsic model discussed above, 
the following issues should be considered in developing a MOSFET model for deep 
submicron RF applications: 

1. The gate resistance should be modeled and included in the simulation. 

2. The extrinsic source and drain resistances should be modeled as real external resistors, 
instead of only a correction to the drain current with a virtual component. 

3. Substrate coupling in a MOSFET, that is, the contribution of substrate resistance, needs 
to be modeled physically and accurately using appropriate substrate network for the 
model to be used in RF applications. 

4. A bias-dependent overlap capacitance model, which accurately describes the parasitic 
capacitive contributions between the gate and the drain/source, needs to be included. 



3.3.3. 1 High-frequency behavior and modeling of gate resistance 

At DC and low frequency, the gate resistance consists mainly of the polysilicon sheet 
resistance. The typical sheet resistance for a polysilicon gate ranges between 20 and 
40 C/sq, and can be reduced by a factor of 10 with a silicide process, and even more 
with a metal stack process. At HF, however, two additional physical effects appear, which 
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will affect the value of the effective gate resistance. One is the distributed transmission 
line effect on the gate and the other one is the distributed effect or NQS effect in the 
channel (see Jin et al. (1998) and Cheng et al. (2001a)). Both theoretical analysis with 
the consideration of these two HF effects and detailed experimental characterization are 
needed to obtain an accurate and physical gate resistance model, which is critical in 
predicting the HF behavior of the MOSFETs in designing an RF circuit. 

In Figure 3.15, it is shown that R G decreases first as channel length L { increases while 
showing a weak bias dependence in this region, then starts to increase with Lf as Lf 
continues to increase above 0.4 |xm while showing a strong bias dependence. The Lf 
dependence of Rq varies for different V G s- At lower V G s, the Lf dependence of R G is 
stronger. Also, Rq for the devices with longer L f increases significantly and has stronger 
V GS dependence. Figure 3.16 shows the per-finger-channel-width Wf dependence of R G . 
It demonstrates that R G increases as Wf decreases when Wf < 6 |xm, and the device with 
the same Wf has higher R G at lower V G s, which becomes more obvious when Wf narrows. 
Figures 3.17 and 3.18 give R G for devices with various geometries at several Vds, from 
which we observe similar Lf and Wf dependencies of R G as what we found in Figures 3.15 
and 3.16. However, the Vds dependence of R G becomes very weak when Vds is larger 
than 1 V. 

The U-shape L f dependence of R G in Figure 3.15 can be explained with the consid- 
eration of the distributed gate effect (DGE) and the NQS effect, in RF MOSFETs. It has 
been well known that the resistance of a polysilicon resistor, simulating the polysilicon 
gate in a MOSFET, is 3 or 12 times smaller (depending on the layout) at HF due to the 
DGE than that at DC but still scales with Wf/Lf. It has also been known that the NQS 
effect occurs in a MOSFET operated at HF in which the carriers in the channel cannot 
respond to the signal immediately. Thus, there is a finite channel transit time for the dis- 
tributed effect of the carrier transportation in the channel due to the varying gate signal. 
In that case, the signal applied to the gate suffers an additional equivalent gate resistance, 




Figure 3.15 Curves of R a versus L t at different Vqs- The dotted lines illustrate approximately the 
dependence of /fc.poiy portion on 1/Lf and the dependence of R a nqs portion on Lf, respectively. 
Reproduced from Cheng Y. et al. (2001a) High frequency characterization of gate resistance in RF 
MOSFETs, IEEE Electron Device Lett., 22(2), 98-100 
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Figure 3.16 Curves of R<, versus Wf at different Vq S . The dotted lines illustrate approximately 
the dependence of /? G ,poiy on W f and the dependence of R a nqs on 1/VTf, respectively. Reproduced 
from Cheng Y. et al. (2001a) High frequency characterization of gate resistance in RF MOSFETs, 
IEEE Electron Device Lett., 22(2), 98-100 
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Figure 3.17 Curves of Rq versus Lf at different Vbs- Reproduced from Cheng Y. et al. (2001a) 
High frequency characterization of gate resistance in RF MOSFETs, IEEE Electron Device Lett., 
22(2), 98-100 



which is proportional to L { /Wf, from the distributed channel resistance, which adds to the 
contribution from the poly-gate resistance. In other words, R G consists of two parts: the 
7? G ,poiy contributed by the poly-gate resistance and the /?f r . „ qs due to NQS effect. As NQS 
effect becomes more significant, the contribution of i? G ,nqs dominates. This is the case in 
devices with longer L { . Thus, we can understand the irregular geometrical dependence 
of Rq in Figure 3.15 when L f is longer than 0.4 |im. It has been shown that the channel 
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Figure 3.18 Curves of R a versus W( at different Vb s . Reproduced from Cheng Y. et al. (2001a) 
High frequency characterization of gate resistance in RF MOSFETs, IEEE Electron Device Lett., 
22(2), 98-100 

transit time for the NQS effect is roughly inversely proportional to ( Vgs — Vh), where V* 
is the threshold voltage of the device, and proportional to Ljr (see Tsividis (1987)). Thus, 
^o.nqs (and hence Rq) is higher in a device with longer L f and at lower Vgs- As shown in 
Figure 3.15, the NQS effect has begun to influence Rq values in devices with relatively 
short L f at RF. However, when L f is short enough, the contribution of Rq nqs is smaller 
and l?G,poiy is dominant. In that case, Rq becomes larger as L f tends to be shorter. 

Similarly, we can understand the Wf dependence of Rq in Figure 3.16. As discussed 
above, the Rq portion from the distributed polysilicon gate, /b,, P oiy, is proportional to 
IVf/Lf; however, the Rq portion from the distributed channel, /fc.nqs, is proportional to 
Lf/Wf. As Wf becomes narrower, Rq, nqs becomes higher, so it may dominate the total 
Rq when Wf reduces to some value, say 6 pm in Figure 3.16. As Wf becomes wider, 
^G.poiy becomes higher, so it may dominate the total Rq when Wf becomes larger than 
some specific value. Thus, as Wf changes, there exists a minimum R G at some point of 
Wf, as demonstrated in Figure 3.16. 

The stronger Vgs dependence of Rq in the narrow Wf region of Figure 3.16 can be 
understood because R G .„ qs with strong bias dependence is dominant in the narrower device, 
while Ro.poiy without bias dependence plays a bigger role in the longer Wf region. 

The Vds dependence of Rq shown in Figures 3.17 and 3.18 can also be explained. It is 
known that the distributed effect at the gate is independent of Vds and the distributed effect 
in the channel is stronger in the saturation region than in the linear region. Thus, according 
to the above analysis, we should have lower Rq nqs (and hence Rq) at Vds = 0.5 V, at 
which the device operates in the triode (or linear) region, than Vds = 1 V, at which the 
device operates in the saturation region, for the device with the same Wf and Lf. When 
Vds is higher than 1 V, the device remains in saturation and the channel conductance (and 
hence the NQS effect) does not change much as Vds increases, so Rq is insensitive to 
Vds (>1 V) in Figures 3.17 and 3.18. 

The distributed transmission line effect on the gate at HF has been studied (see, for 
example, Liu and Chang (1999)). It will become more severe as the gate width becomes 
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wider at higher operation frequency. So multifinger devices (if they have wide channel 
widths) are used in the circuit design with narrow gate width for each finger to reduce 
the influence of this effect. A simple expression of gate resistance, Rq, based on that in 
DC or low frequency has been used to calculate the value of gate resistance with the 
influence of the DGE at HF. Flowever, a factor of a is introduced, which is 1/3 or 1/12 
depending on the layout structures of the gate connection to account for the distributed 
RC effects at RF, as given in the following: 

= + v)- l3 - 22) 

In Eq. (3.22) Rash is the gate sheet resistance, W , is the channel width per finger, Lf is the 
channel length, N t is the number of fingers, and W eKt is the extension of the polysilicon 
gate over the active region. 

Complex numerical models for the gate delay have been proposed by Abou-Allam 
and Manku (1997). However, the simple gate resistance model with the a factor for the 
distributed effect has been found accurate up to 1/2 fj for a MOSFET without significant 
NQS effects as discussed by Enz and Cheng (2000). 

For the devices with NQS effects, additional bias and geometry dependences of the 
gate resistance are needed to account for the NQS effect. It has been proposed that 
an additional resistive component in the gate should be added to represent the channel 
distributed RC effect, which can be “seen” by the signal applied to the gate, as shown in 
Figure 3.19. Thus, the effective gate resistance Rq consists of two parts: 

Rq = /^G.poly + //( j . ::q, (3.23) 

where Rc, p( ,\ y is the distributed gate electrode resistance from the polysilicon gate material 
and is given by Eq. (3.22) and Rq nqs is the NQS distributed channel resistance seen from 
the gate and is a function of both biases and geometry. 

Efficient and accurate modeling of the NQS effect in MOSFETs is very challenging. An 
R g model with the consideration of the NQS effect has been reported (see, for example, 




], 

Figure 3.19 Equivalent gate resistance consists of the contributions from the distributed gate poly 
resistance and distributed channel resistance 
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Jin et al. (1998)). However, the following simple expression can be used to obtain the 
^G.nqs approximately in the strong inversion regime: 

tfo.nqs = p- (3.24) 

where G m is the transconductance of the device and ft is a fitting parameter with a typical 
value around 0.2. 



3.3. 3. 2 Modeling of source and drain resistances 

The total source and drain series resistances in a MOSFET used in Integrated Circuit 
(IC) designs have several components such as the via resistance, the salicide resistance, 
the salicide-to-salicide contact resistance, and the sheet resistance in the LDD region, 
as shown in Figure 3.20. However, the contact and the LDD sheet resistances usually 
dominate the total resistance. The typical value of the sheet resistance is around 1 kf2/sq 
in the LDD region for a typical 0.25-p.m CMOS technology and much smaller in more 
advanced technologies. 

It has been known that the source/drain resistances are bias-dependent. In some compact 
models such as BSIM3v3 (Cheng etal. (1997b)), these bias dependencies are included. 
However, since these parasitic resistances in BSIM3v3 are treated only as virtual compo- 
nents in the I -V expressions to account for the DC voltage drop across these resistances, 
they are invisible to the signal in the AC simulation. Therefore, external components 
for these series resistances need to be added outside an intrinsic model to accurately 
describe the HL noise characteristics and the AC input impedance of the device. Typi- 
cally, the source/drain resistances Rn and Rs without including any bias dependence can 
be described by 

(3.25) 

(3.26) 
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Figure 3.20 An illustration of the components of the source/drain series resistance 
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where r dw and r sw are the parasitic drain and source resistances with unit width and 
i?oo and A’so account for the part of the series resistances without the width dependence. 
Equation (3.25) can work reasonably well in today’s MOSFETs for RF applications, 
because the LDD region in these devices with advanced technologies (0.18 p,m and less) 
has a very high doping concentration. Thus, the bias dependence of R D and R s becomes 
weaker compared with devices with longer channel lengths and lighter LDD doping 
concentrations in the older technology generation. 



3. 3. 3.3 HF behavior and modeling of substrate resistance 

Usually, the location of the substrate tie to ground the MOSFETs in low-frequency analog 
IC design is not regular and can be put in any suitable place in a layout of a circuit. This 
makes the substrate resistance a function of the distance between the active device and the 
substrate contacts besides the active device size and substrate contact shape. The values 
of the substrate resistance may be different for devices in different locations even though 
they have the same channel length and width. In low-frequency analog design, accurate 
evaluation of the substrate resistance is not required because the influence of the substrate 
resistance can be ignored. However, in RF ICs, the substrate resistance can contribute 
significantly to the device behavior, mainly output admittance, and accurate prediction of 
the substrate resistance becomes important. 

Although it is always desirable to have a detailed distributed RC network to account 
for the contribution of the substrate components, it is too complex to be implemented in 
a compact model. Some tools using three-dimensional or quasi-three-dimensional numer- 
ical approaches to simulate the effects of the substrate resistance are available; however, 
a proper integration of such a tool into the design system remains an issue. Also, the 
accuracy of the simulation results is dependent on the accuracy of the process infor- 
mation provided by process simulation, which needs to be calibrated carefully (very 
time-consuming) to obtain the desirable accuracy. Thus, a good compromise is to use 
simplified lumped RC network, which is be accurate in required operation frequency 
range, to simulate the contribution of the substrate components. 

It has been known that the contribution of the substrate resistance R su b, which provides 
an AC path to the signal and influences the output admittance T 22 behavior, cannot be 
ignored at radio frequency (RF). An RF model without including the substrate resistance 
will be 20% or more off the measured data of the T 2 2 characteristic of a MOSFET. This 
is not desirable in RF IC design because an accurate prediction of T 2 2 is very important 
in designing a matching network to compensate the overall gain over a wide frequency 
range. Also, accurate substrate resistance is required in a power amplifier design to eval- 
uate the overall power loss properly, and in an LNA design to predict the noise figure 
without underestimating the contribution of the substrate resistance. Recently, RF models 
with the substrate components, which include the substrate resistances and drain/source 
junction capacitance, have been published. However, detailed characterization of these 
substrate components in MOSFETs at RF has not been reported yet. It was expected 
that the substrate resistances may be bias-dependent due to the variations of the deple- 
tion regions below the gate and surrounding the source and drain diffusions. The HF 
experimental exploration of these substrate components in MOSFETs is very important 
to help the understanding of the device behavior and the modeling of the MOSFET 
at RF. 
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To accurately evaluate the influence of the substrate resistance in RF IC design, device 
structures with their own substrate contacts are preferred. This device design may also 
help to reduce the cross-talk caused by the substrate coupling between the devices. Its 
disadvantage is that it will take more space owing to the substrate contacts for each 
device in the circuits. However, it may be acceptable for RF IC because of the small 
device amounts in RF circuits compared to other circuits, say, the digital IC. It elimi- 
nates the geometry uncertainty caused by the irregular substrate contact design so that 
the designers can predict the contribution of the substrate resistance accurately with the 
developed model. 

Before we discuss the model for substrate components, we first discuss the measured 
HF behavior of the substrate network including the substrate resistance R sub and junc- 
tion capacitance CjDB- The details of the extraction of the substrate components will 
be discussed in Section 3.4.2. As we mentioned earlier, the substrate components will 
mainly influence the Y 2 2 characteristics at HF. Figures 3.21 and 3.22 give the measured 
Y 2 2 behavior versus frequency at various gate and drain-bias conditions. Both real and 
imaginary parts of Y 22 show strong bias dependence on both gate and drain biases. Further, 
the characteristics of R sub , CjDB, and Cgd versus frequency at different gate and drain 
biases that can be extracted from measured Y 22 according to the procedures to be dis- 
cussed later are shown in Figures 3.23 to 3.26. In Figures 3.23 and 3.24, a weak gate-bias 
dependence of both substrate resistance and junction capacitance can be observed. It is 
understandable according to the device structures, and demonstrates that the modulation 
of the gate bias to the channel depletion layer does not influence the substrate resis- 
tance significantly. A strong gate-bias dependence of Cgd is understandable without any 
surprise. Keeping in mind the weak gate-bias dependence of the substrate resistance, the 
strong gate-bias dependence of Re{T 22 ) shown in Figure 3.22 is mainly contributed by the 
channel resistance Rds- The bias dependence of gate-to-drain capacitance Cgd does not 




Figure 3.21 Measured Y 22 data at different drain biases show obvious drain-bias dependence. 
Reproduced from Cheng et al. (2000a) On the high frequency characteristics of the substrate resis- 
tance in RF MOSFETs, IEEE Electron Device Lett., 21(12), 604-606 
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Figure 3.22 Measured Y 22 data at different gate biases show strong gate-bias dependence. Repro- 
duced from Cheng et al. (2000a) On the high frequency characteristics of the substrate resistance 
in RF MOSFETs, IEEE Electron Device Lett., 21(12), 604-606 




Figure 3.23 Extracted Cgd and Cjdb capacitances at various gate -bias conditions 



influence the bias dependence of Rc{ Y 12 ) significantly; however, it is believed that Cgd 
causes the strong gate-bias dependence of Im{ Y 22 ) shown in Figure 3.22. The I m { Y su \ y [ 
data will show a weak gate-bias dependence after de-embedding the contribution of Cgd 
from the measured Y 22 data. The weak gate-bias dependence of junction capacitance, as 
shown in Figure 3.22, is consistent with the measurement results performed in low or 
medium frequency range. 
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Figure 3.24 Extracted substrate resistance shows a very weak gate-bias dependence. Reproduced 
from Cheng Y. et al. (2000) On the high frequency characteristics of the substrate resistance in RF 
MOSFETs, IEEE Electron Device Lett., 21(12), 604-606 




Figure 3.25 Extracted Cgb and Cjdb capacitances at various drain-bias conditions 

However, the obvious drain-bias dependence of Re[ K 22 }, shown in Figure 3.21, is 
believed to be caused by the contribution of both channel resistance and junction capaci- 
tance. The data still shows a dependence on drain bias after de-embedding the influence of 
7 ?ds. Rg, Q;d. and so on from Rc{ K 2 2 } - After further removing the influence of junction 
capacitance, the data (representing the substrate resistance) shows a weak dependence on 
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Frequency (GHz) 

Figure 3.26 Extracted substrate resistance shows a very weak drain-bias dependence. Reproduced 
from Cheng et al. (2000a) On the high frequency characteristics of the substrate resistance in RF 
MOSFETs, IEEE Electron Device Lett., 21(12), 604-606 

drain bias as demonstrated in Figure 3.26. However, unlike the case of varying the gate 
bias, the drain-bias dependence of 1 m ( Y 22 \ is mainly due to the existence of the junction 
capacitance instead of Cqd- Much stronger drain-bias dependence of the junction capaci- 
tance CjDB than of Cgd has been found as shown in Figure 3.25. It is also consistent with 
the measurement results of junction capacitances at low and medium frequency. 

A simple EC for the substrate network shown in Figure 3.7 has been used to analyze 
the HF substrate-coupling effect and the characteristics of substrate resistance at HF (see, 
for example, Cheng et al. (2000a)). Even though a simpler substrate network has been 
reported by Tin et al. (1999), it is found that the three-resistor substrate network can 
ensure better model accuracy over a wider frequency range. 

Figure 3.27 illustrates a lumped RC EC for the substrate components in a multifinger 
device with substrate ties residing at both sides of the device. Noting that all the source 
(or drain) terminals for different fingers are connected together, and the source terminal 
is grounded together with the substrate terminal, a simplified substrate network, as shown 
in Figure 3.7, with the following relationships can be obtained: 



N s 



CsB = C s b,k, 


(3.27) 


k= 1 




Yd 




Cdb = QbA 1 


(3.28) 


k= 1 




1 Ns 1 
1 =y 1 , 

-KsB Rsb.k 

k= 1 


(3.29) 
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Figure 3.27 Illustration of the equivalent circuit (EC) for substrate components in a multifinger 
device. Reproduced from Cheng et al. (2002c) Parameter extraction of accurate and scaleable 
substrate resistance components in RF MOSFETs, IEEE Electron Device Lett., 23(4), 221-223 
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( 3 . 31 ) 



where Csb and Cdb are the total source-to-bulk and drain-to-bulk capacitances, CsB.tr 
and CDB.tr are the source-to-bulk and drain-to-bulk capacitances of each source and drain 
region in the multifinger device, N s and /V d are the numbers of the source and drain 
regions, R s B , Rdb, and Rusb are the total equivalent resistances between the source and 
the substrate, between the drain and the substrate, and between the source and the drain 
underneath the channel in the substrate, Rsb.k, Rdb.k, and Rdsr.k are the resistances, 
corresponding to each single source/drain. 

Assuming no difference between the outer and the inner source/drain regions, we have 



Csb = N s C sb , k , 



Cdb = AdC dW , 



^DSB = 



Rdsb.kLf 

NfWf 



( 3 . 32 ) 

( 3 . 33 ) 

( 3 . 34 ) 



where L f and W ( are the channel length and the width per finger. Rd sb . k is the sheet 
resistance in the substrate underneath the channel between the source and drain in a 
single-finger device. 

Noting that the value of the substrate resistance from the outer finger subdevice is 
much smaller than that from the inner finger device, and also noting that the device is 
symmetric, we have the following: 



1 ~ 1 + 1 

^SB ^sb.l ^sb.Ns 
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and 

1 ~ 1 + 1 

RdB ^db,l ^db,Nd 

Also, according to the layout, the following equations have been used: 

(3.35) 

(3.36) 

where r dbw and r sbw are the substrate resistances with unit-channel width. 

Generally, assuming that the device is symmetric with respect to the source and the 
drain and that it has no difference between the outer and the inner source/drain regions 
in a multifinger device, we have 



R 



DB 



R 



SB 



''dbw 

"wT’ 

Gbw 



Rdsb — 



r dsb^-f 

NfWf 



(3.37) 



where r dsb is the sheet resistance in the substrate between the source and the drain. 

Some bias dependence of the substrate resistances had been expected, on the basis 
of the fact that the depletion regions below the gate and surrounding the source and 
drain diffusions may vary at different gate and drain-bias conditions. However, it has 
been found that the bias dependence of the substrate resistances is actually very weak 
for the devices with substrate ties isolated by shallow trench from the active region, and 
the above simple substrate resistance network is accurate up to 10 GHz, as discussed by 
Cheng et al. (2000a). 



3.3.3.4 High-frequency behavior and modeling of parasitic capacitances 

It has been known that the gate capacitance can be directly extracted from the measured 
Y -parameters as discussed by Cheng et al. (2002). Figure 3.28 shows that the imaginary 
part of Y\\ and Yu, which can be used to extract the Cgg and the Cqd as we will discuss 
in Section 3.4.2. The bias dependence of the Y — parameters (the gate capacitance) is 
obvious as the gate bias varies. Strong drain-bias dependence has also been expected for 
the extracted capacitance data versus the gate and drain biases. 

As shown in Figure 3.29, the parasitic capacitances in a MOSFET can be divided 
into the following components: (1) the outer fringing capacitance between the poly silicon 
gate and the source/drain. Cpo; (2) the inner fringing capacitance between the poly sil- 
icon gate and the source/drain, Cpi; (3) the overlap capacitances between the gate and 
the heavily doped S/D regions (and the bulk region), Cgso and Cgdo(CgboX which are 
relatively insensitive to terminal voltages; (4) the overlap capacitances between the gate 
and the lightly doped S/D region, Cgsol and Conor , which change with biases; (5) the 
source/drain junction capacitances, Cjd and Cjs; and (6) the substrate capacitance, Csub- 
Most of them have been included in models for digital/analog applications (see the dis- 
cussion by Cheng and Hu (1999)). However, additional parasitic capacitance components 
may have to be added to the existing models (either intrinsic or extrinsic capacitance 
models) if they cannot meet the accuracy requirements at RF. 
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Figure 3.28 Measured imaginary part of Y\\ and Y 12 at different bias conditions 



G 




/7-sub 



Figure 3.29 Illustration of different capacitance components in a MOSFET 

In Figures 3.30 and 3.31, the capacitances Cgdp and Cgsp obtained from the total 
capacitances extracted from the measured ^-parameters and the intrinsic capacitances 
simulated with the model are shown. The definitions of Cgdp and Cgsp are given in the 
following: 



CgDP — CGDtotal_extracted CGDintrinsic_simulated> 

Cgsp = CGStotal_extracted CGSintrinsic_simulated> 



(3.38) 

(3.39) 
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Figure 3.30 An example to show the bias dependence of extracted equivalent extrinsic capacitance 
between the gate and the source from measured HF data at different drain biases. Reproduced from 
Cheng (2002b) MOSFET Modeling for RF IC Design, in CMOS RF Modeling, Characterization 
and Applications , Jamal D. M. and Fjeldly T. A., eds., World Scientific Publishing, Singapore 



A f = 10 





Figure 3.31 An example to show the bias dependence of extracted equivalent extrinsic capacitance 
between the gate and the drain from measured HF data at different drain biases. Reproduced from 
Cheng (2002b) MOSFET Modeling for RF IC Design, in CMOS RF Modeling, Characterization 
arid Applications, Jamal D. M. and Fjeldly T. A., eds., World Scientific Publishing, Singapore 
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where CoDtotai_extracted is the total Cqd capacitance extracted from the measured data, 
! 'g i )int n r. sic simulated is the intrinsic Cqd simulated by the model, CGstotai_extracted is the total 
Cos capacitance extracted from the measured data, and Cosintrinsic^imuiated is the intrinsic 
Cgs simulated by the model. 

According to the definition of Cgdp and Cgsp, we can consider these capacitances 
as overlap capacitances if the intrinsic capacitance model is accurate enough. However, 
in some cases, Cgdp and Cgsp should not be considered as overlap capacitances since 
they may contain the correction to the intrinsic capacitances if the intrinsic capacitances 
are not properly modeled. It is clear that Cgdp and Cgsp have strong bias dependences 
that cannot be fitted by a constant overlap capacitance model. To improve the overall 
RF model accuracy, an engineering approach, adding additional capacitance components 
with bias dependence for Cgdp and Cgsp in the subcircuit, can be used if the capacitance 
model in a RF model cannot provide good accuracy over different bias regions. 

The substrate capacitance is another extrinsic capacitance that should be considered 
in a subcircuit model for ultra-HF applications. In the above substrate RC network, we 
did not include the contribution of the substrate capacitance. It does not influence the 
model accuracy to fit the measured data up to 10 GHz. However, the substrate capacitance 
component may be necessary in a subcircuit model when the device operates at frequencies 
much higher than 10 GHz. 



3.3.4 Non-quasi-static Behavior 

Figure 3.32 shows the characteristics of I?Gsh,HF (= RoNfLf/Wf) extracted for devices 
with different L f . R G sh,cai in the figure is the measured DC gate sheet resistance but 
divided by 3 to consider the distributed effect at HF and is a constant value independent 
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Figure 3.32 Gate sheet resistance ^osh.HF versus frequency for devices with different channel 
lengths. Higher value of /fcsh.HF is obtained at HF compared with RGsh,cai- Reproduced from 
Cheng et al. (2001b) Frequency dependent resistive and capacitive components in RF MOSFETs, 
IEEE Electron Devices Lett., 22(7), 333-335 
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Figure 3.33 Effective unit-area gate capacitance CoG.unit versus frequency for devices with differ- 
ent channel lengths. The value of Coo.unit is reduced for the device with longer L f , and also CoG.unit 
is not a constant as the frequency varies in the devices with strong NQS effect. Reproduced from 
Cheng et al. (2001b) Frequency dependent resistive and capacitive components in RF MOSFETs, 
IEEE Electron Devices Lett., 22(7), 333-335 



of the device geometry. However, the measurements show that the /?Gsh,HF not only is 
larger than the ifcsh.cai (even for the device with an Lf of 0.35 |inr) but also increases as 
Lf increases. For the device with an Lf of 1.35 p,m, the frequency dependency of 7?Gsh.HF 
is obvious, which is in contradiction to what we have seen in low and intermediate 
frequencies for the components in a MOSFET. Figure 3.33 shows the characteristics of 
CGG,unit(= CQa/{WfLfNf}) versus frequency for devices with different channel lengths. 
As observed, the extracted CoG.unit shows some weak frequency dependency for the 
device with an Lf of 0.35 |xm but can still be considered approximately constant over the 
frequency range. This is consistent with the results in low and intermediate frequencies. 
However, for the devices with longer Lf, the value of CGG.unit is smaller compared with 
the device with shorter Lf at the same operation frequency. Furthermore, CoG.unit is not 
a constant any more with frequency and decreases as the frequency increases, which is 
significant in the devices with the longest channels. Figure 3.34 shows that the normalized 
G m (— Rc( F 2 1 )/ ReOLiifOi) degrades seriously in devices with longer L f as the frequency 
increases, where / 0 is a fixed frequency. The frequency dependency of G m in the device 
with an L f of 0.35 |inr is weak; however, it becomes very strong in the device with an 
Lf of 1.35 |xm (see Cheng et al. (2001b)). 

As mentioned earlier, the NQS effect results in a signal delay or even a malfunc- 
tion of the circuits in some cases when a MOSFET operates at HF as discussed by Oh 
et al. (1980). Typically we can see this NQS effect in a device with an L of 10|xm 
at about 1 MHz as discussed by Paulous et al. (1983). However, it is expected that the 
critical channel length ( L c ) for NQS effect to happen will decrease as the signal fre- 
quency increases. 

When the device cannot respond to the signal immediately, the distributed effect of 
the channel resistance should be accounted for. This distributed effect in the channel 
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Frequency (Hz) 

Figure 3.34 Normalized equivalent transconductance versus frequency for devices with different 
Lf. The degradation of G,„ can be explained with the existence of NQS effect. Reproduced from 
Cheng et al. (2001b) Frequency dependent resistive and capacitive components in RF MOSFETs, 
IEEE Electron Devices Lett., 22(7), 333-335 



or NQS effect will cause an increase in the effective gate resistance as discussed by 
Jin et al. (1998) and Cheng et al. (2001a). So it can be understood that in Figure 3.32 
because of the existence of the NQS effect the extracted HF gate sheet resistance, L Gs h.HF, 
is higher than /?Gsh,cai> a theoretically estimated value for gate sheet resistance where only 
the distributed effects on the gate are considered. Because the influence of the NQS effect 
can be ignored at low frequency but increases significantly as frequency increases, the 
extracted effective gate sheet resistance, /?gs 1 i,hf> exhibits strong frequency dependency. 
Similarly, the frequency dependency of /(gsh.hf in a 0.35-p.m device is not obvious 
because of the weak NQS effect in this device, but becomes stronger as L f increases. 
Thus, larger J?Gsh,HF and stronger frequency dependency of L Gs h,HF are found in devices 
with larger Lf. 

It is also known that the NQS effect will equivalently introduce a transcapacitance 
between the drain and the gate (see, for example, Cheng et al. (2001a)). The displacement 
current from this additional capacitance (referred to as C nqs ) can cancel partially the output 
current, which is equivalent to an increased delay to the signal. C nqs is negative relative 
to the positive gate-to-source, gate-to-drain, and gate-to-bulk capacitances Cqs, Cod, and 
Cgb, so the effective Cgg with NQS (the sum of Cgs, C’gd, Cqb, and C nqs ) is less than 
that without NQS (the sum of Cos • Cqd> and Cqb only). In devices with longer Lf, the 
NQS effect is stronger, so |C nqs | is larger and hence Cgg is smaller. Also, as frequency 
increases, the NQS effect in the device is stronger, so |C nqs | increases and Cgg decreases. 
Thus, a frequency dependency of Cqg can be seen in Figure 3.33 owing to the existence 
of the component C nqs . 

The degradation of G m at HF has been considered as an important phenomenon that 
should be accurately modeled to predict the circuit behavior at HF (see, for example, 
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Tsividis and Suyama (1993)). The reason for the degradation of G m is considered as the 
contribution of the NQS effect even though it may be partially caused by the increased 
signal “feed-through” via Cdg at HF 2 . 



3.4 MODEL PARAMETER EXTRACTION 

3.4.1 RF Measurement and De-embedding Techniques 

For a model to describe the device characteristics accurately, all important model parame- 
ters should be extracted from measured data. To extract the RF model parameters, on-chip 
HF measurements are performed by using specifically designed test structures. Also, a 
de-embedding methodology has to be developed to remove the influence of the para- 
sitics in the test structure from the measured raw data in order to obtain the data for the 
characteristics of the device-under-test (DUT). 

Figure 3.35 illustrates the setup of an HF measurement system for on-wafer RF mea- 
surements. A controller is used to send the commands to instruments (vector network 
analyzer (VNA) and I—V tester, etc.) and the probe station to perform the measurements 
for a specific DUT and to gather the measured data for postprocessing. To ensure the 
accuracy of the measurements, a system calibration has to be performed before con- 
ducting any measurements on the DUT. Typically, the system calibration for on-wafer 
measurements is done by using a so-called impedance standard substrate (ISS) that can 
provide high-accuracy and low-loss standards for two-port calibration procedures such 
as short-open-load-through (SOLT) and through-reflect-line (TRL). The SOLT calibration 
has been widely used because it is supported by virtually every VNA. However, TRL 
calibration is the most fundamental of the advanced calibrations and requires the least 
amount of information about the standards. Only VNAs with advanced calibration capa- 
bilities will support the TRL calibrations. ISS calibration can ensure reasonable accuracy 
if the substrate and the interconnect losses of the DUT are comparable to those of the ISS. 
Recently, however, it has been discussed that additional de-embedding of substrate para- 
sitics in RF CMOS devices may be needed because of the high substrate losses compared 
with other devices such as GaAs MESFETs. 




Figure 3.35 Equipment requirement of an HF measurement system 



2 Because the existence of Cdg provides a signal path, more and more signals are fed back through this 
capacitance as frequency increases, so the total output current (and hence the transconductance) is reduced. 
However, it can only explain partial G,„ degradation. 
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Besides the system calibration discussed above, de-embedding methodology for raw 
data measured from the DUT has also to be developed on the basis of specific test 
structures designed according to de-embedding techniques. Figure 3.36(a), (b), and (c) 
show the test structures for the so-called two-step de-embedding procedure. Figure 3.36(a) 
illustrates the test structure with the DUT. The pads for port 1 and port 2 are signal pads 
connecting the gate and the drain terminal of the DUT and the top and the bottom 
ground pads connect to both source and substrate of the DUT, as illustrated further in 
Figure 3.37. This test structure is used for ^-parameter measurements of two-port systems. 
Test structures for multiport systems (more than two ports) can be designed and measured 
also. But the measurement system with specific design consideration of the probe tips and 





Figure 3.36 Illustrations of the test structures for a two-step calibration of 5-parameter measure- 
ments: (a) test structure with the DUT; (b) open test structure; and (c) short test structure 





Figure 3.37 Illustration of on-wafer HF measurement for a two-port system 











MODEL PARAMETER EXTRACTION 103 



calibration techniques should be used. Also, the de-embedding technique of the raw data 
is more complex than that measured from a two-port system. 

Figure 3.36(b) is the so-called “open" structure for a two-port measurement. It uses 
the same test structure as in Figure 3.36(a) but the DUT has been removed, so all the 
pads are open without any connections between them. Figure 3.36(c) shows the so-called 
“short" structure that is the opposite of the “open" structure in which all of the pads are 
shorted to each other. 

Different de-embedding techniques have been developed on the basis of different cali- 
bration test structures (see discussion by Koolen et al. (1991) and Chen and Deen (2001)). 
Here, the de-embedding procedure based on the open and short calibration test struc- 
tures, illustrated in Figure 3.36, is discussed as an example. This two-step de-embedding 
technique has been widely used in HF measurements for different technologies. 

Typically, a DUT with parasitics from the test structures can be represented by the 
equivalent circuit in Figure 3.38, where Tpi, Ip 2 , and 7p3 represent the influence of the 
parallel parasitics and Zsi, Z S 2 , and Z S 3 describe the influence of the series parasitics. 

The parallel elements Tpi, Tp 2 , and Fp 3 can be obtained from the measured data of the 
open structure, that is, 



Fp 3 = — 7i2,open = — ^21, open, (3.40) 

ipi = ^11, open + ^12, open i (3.41) 

Tp 2 = 722, open + ^21, open- (3.42) 

The series elements Z s i, Z s2 , and Z s3 can be obtained from the measured data of both 
open and short structures, that is, 



Z s i + z s3 z s3 

Zs3-Z s 2 + Z S 3 



( f^shoit fupen ) 



(3.43) 



The measured data corresponding to the transistor can be obtained according to the 
following equation 



^transistor — [(Idut 



1 open/ 



( 7 s hort ' 



W-l-l 



1 open > 



(3.44) 




S/B S/B 



Figure 3.38 Equivalent circuit used for two-step de-embedding of measured HF data of MOSFETs 
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Thus, according to the above, the procedures of the two-step de-embedding technique 
can be given as follows: 

1. Measure the s -parameters (Sdut, .V ope n , and 5 S hort) for DUT, open and short test struc- 
tures and convert them to T -parameters (Tdut. Topen, and T S hort)- 

2. Perform the first step de-embedding by removing the parallel parasitics from both Tdut 
and T S hort according to the following equations: 



Zduti — Tout — T open , 

T,hort 1 — T s hort fopen- 



(3.45) 

(3.46) 



3. Perform the second de-embedding by removing the series parasitics Z s | lort i , converting 
from T short i, from Z DU ti, and from T DUT i, according to the following equation: 



■^transistor — Zduti — ■^shortl • (3.47) 

Figures 3.39 to 3.42 show the data of the measured Y\\ and Y 22 before and after 1 step 
and 2 step de-embedding. Significant difference between the data before and after 1 step 
de-embedding has been observed. Thus, the data de-embedding with the open calibration 
structure is absolutely necessary to extract accurate parameters of an RF model. A minor 
difference between the data after the 1-step and the 2-step de-embedding indicates that 
the calibration with the short structure may be ignored for the MOSFETs at a frequency 
range up to 10 GHz. However, for the device to work at a much higher frequency range, 
the importance of the calibration with the short structure should be considered. Also, the 
short calibration may have to be used to obtain the measured data for other devices such 




0 2 4 6 8 10 12 

Frequency (GHz) 

Figure 3.39 Illustration of the necessity of the de-embedding of the real part of the measured 
Tn data. Reproduced from Cheng (2002b) MOSFET Modeling for RF IC Design, in CMOS RF 
Modeling, Characterization and Applications, Jamal D. M. and Fjeldly T. A., eds.. World Scientific 
Publishing, Singapore 
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Figure 3.40 Illustration of the necessity of the de-embedding of the imaginary part of measured 
I'll. Reproduced from Cheng (2002b) MOSFET Modeling for RF IC Design, in CMOS RF Mod- 
eling, Characterization and Applications, Jamal D. M. and Fjeldly T. A., eds., World Scientific 
Publishing, Singapore 




Figure 3.41 Another example to show the importance of the de-embedding of the real part of 
measured 122- Reproduced from Cheng (2002b) MOSFET Modeling for RF IC Design, in CMOS RF 
Modeling, Characterization and Applications, Jamal D. M. and Fjeldly T. A., eds.. World Scientific 
Publishing, Singapore 
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Figure 3.42 The figure shows a significant difference between the imaginary part of the measured 
Y 2 2 before and after de-embedding. Reproduced from Cheng (2002b) MOSFET Modeling for RF IC 
Design, in CMOS RF Modeling, Characterization and Applications, Jamal D. M. and Fjeldly T. A., 
eds.. World Scientific Publishing, Singapore 

as inductors because the devices themselves are very sensitive to the influence of the 
series parasitics. 



3.4.2 Parameter Extraction 

Depending on the EC used in the model, methodologies of HF parameter extraction have 
been developed (see, for example, Jen et al. (1998) and Kolding (2000)). In the previous 
section, we have discussed the EC of a MOSFET for RF applications. Usually, the 7- 
parameter analysis of the EC is adopted to obtain the necessary equations to extract the 
values of some resistive and capacitive components. It has been known that the poles due 
to the terminal resistances (that usually are small because of the large finger numbers) 
are at a much higher frequency than typical transit frequencies, so that they basically can 
be neglected when calculating the 7-parameters and the related quantities. 

The substrate resistances in the small-signal circuit of Figure 3.10 are also neglected 
when analyzing the 7-parameters (7n, 7 12 , and 7 2 i, except 7 22 ) to obtain expressions 
that are suitable for use in parameter extraction. 

The parameters related to the DC characteristics are extracted with the data from the 
DC measurements. The methodologies for the DC model parameter extraction have been 
well developed (see, for example, Cheng et al. (1997b)) and they are not discussed here. 
Next we will focus on the discussion of the extraction of the AC parameters for the 
components shown in Figure 3.10. 

The EC given in Figure 3.10 contains too many components, especially current sources, 
which make the 7-parameter analysis very complex and difficult if not impossible, to 
obtain any useful analytical expressions for the parameter extraction. In order to extract 
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the AC parameters, the influence from the intrinsic components has to be minimized. By 
considering the transistor biased in the strong inversion mode with V DS = 0 V, the intrinsic 
behavior of the transistor becomes symmetric in terms of the drain and the source ter- 
minals. Therefore, the effects of the transconductances and the transcapacitances become 
very small and can be neglected, that is, G m ~ 0, G m b ~ 0, C,„ ~ 0, C m b ~ 0, Csd ~ 0, 
and the small-signal EC in Figure 3.10 can be simplified to that shown in Figure 3.43, 
where /?ds = 1/Gds- 

By applying a gate bias high enough to operate the device in strong inversion regime, 
the intrinsic gate-to-bulk capacitance Cgb is small enough and can be neglected. The 
EC for the Y\\ parameter analysis is obtained, as shown in Figure 3.43, by shorting the 
output port and neglecting Cgb in Figure 3.10. Since the transistor is operating in the 
linear region with Vds = 0, Cgs is approximately equal to Cqd- The structure and the 
equivalent effects of the circuit are fully symmetric, which makes the effect of A 1 ns very 
small so that it can then be neglected. Further, the following assumptions have been 
adopted in the 7-parameter analysis of the equivalent circuit in Figure 3.10: 

1. Rq, and Rn are dominated by the contributions from the resistance of poly sili- 
con and diffusion layers and are treated as parameters independent of bias condition 
and frequency. 

2. The equivalent impedance from the intrinsic source/drain nodes to the external 
source/drain nodes are dominated by the terminal resistances R s and R □ , that is. 




B 



Figure 3.43 An equivalent circuit used for extracting the HF model parameters 
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3. The frequency range considered in this analysis is up to 10 GHz, within which the 
following simplifications hold: (coCqs Rs) 2 « 1, (cuCgd^d) 2 <5C 1, co 2 CgsCgd(Rd + 
Rs)Rg « 1, and (1 + jo>C GG R G y 1 1 — jo)C GG R G , where Cgg is the total gate 

capacitance C GG = C G s + C G d + C G b- 



On the basis of the above, the following approximate equations for the Y -parameters can 
be obtained: 



Tn » 


a m 2 (Cq G R g + Cq S Rs + Cq D /?d) + jo)C GG , 


(3.48) 


Fl2 s 


s —° )2 C gg C gg R g — j o>C G \) . 


(3.49) 


T 2 1 s 


a G m ~ orC GG C G u R(, — ja>(C(;\) + G m R G C GG ). 


(3.50) 



Direct extraction of the AC parameters can be performed from the measured data 
according to the above equations, 
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(3.52) 

(3.53) 

(3.54) 

(3.55) 

(3.56) 

(3.57) 



Depending on the measured data, which can be influenced by the design of the test 
structure, the calibration of the measurement system, the experience of the measurement 
person, and the accuracy of the de-embedding procedures, the values of R G and R% 
extracted from the S-parameter measurements may or may not equal the ones extracted 
from DC measurements. To ensure that the DC characteristics predicted by the model 
parameters extracted from DC measurements are not disturbed by the possible different 
A’i) and Rs extracted from the measured 5-parameters, it is recommended that the values 
of Ad and Rs extracted from DC measurements are used in extracting the AC parameters. 
In that case, the R G parameter can be extracted with the following equation: 



Re{Fn}-ft; 2 (C 2 D RD + C 2 s Rs) 
Im{F„} 2 



(3.58) 



To extract the parameters for the substrate network, additional analysis for the F 22 
parameter (Vqs = Vos = 0) is needed. Figure 3.43 gives the equivalent circuit for the 
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device at the given bias conditions. To simplify the analysis, the influence of R D is 
subtracted first from the Z 22 corresponding to the two-port network given by Figure 3.43, 

Z 22 ' = Z 2 2 — Rd- (3.59) 

An FIF small-signal EC of MOSFET is given in Figure 3.44 for the devices at the 
saturation-operating regime. The box surrounded by the dotted line is the RC network for 
the substrate components. According to the EC, it is known that the measured F 22 includes 
at least the contribution from the gate resistance R G , drain series resistance R D , source 
series resistance R s , channel resistance R DS , gate-to-source capacitance Cos, and gate-to- 
drain capacitance Cgd besides the substrate components. To understand the FIF behavior 
of the substrate components, we should either use specific test structures to measure the 
contributions from the substrate components only or de-embed the contributions of these 
components such as R G and Cgd, and so on from the measured fA. Ffere, we adopt the 
latter approach. Next, we present the methodology of de-embedding the measured T 22 to 
obtain the T suh data representing the contribution of substrate components. By performing 
a tedious but straightforward T-parameter analysis for the EC shown in Figure 3.43, we 
finally obtain the following equations: 



Re{Tsub) = Re{y 2 2'} - R G (coC GD ) 2 - (3.60) 

Rds 

Im{T SU bi = - jo>C G D (3.61) 

where y 22 is the T 22 without the influence of R G . T SU b is the output admittance of the 
substrate network in Figure 3.45, co — 2 nf and / is the operation frequency. In the above 
analysis, the contributions of transconductances G m and G,„b are ignored since no obvious 
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Figure 3.44 Equivalent circuit used for Y -parameter analysis to extract the HF model parameters 
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Figure 3.45 A simplified equivalent circuit of the substrate network. Reproduced from Cheng Y. 
et al. (2000) On the high frequency characteristics of the substrate resistance in RF MOSFETs, 
IEEE Electron Device Lett., 21(12), 604-606 



current flows in the channel at the given bias conditions. Also, the influence of R s on the 
total admittance is not taken into account in the analysis. This is reasonable because of the 
dominant contribution of Cqs- Furthermore, the assumptions of or ( Cqs + Cgb) 2 ^g ^ 1 
and (ojCqo) 2 R ( 2 ; <5C 1 are used, which are generally valid in the frequency range up to 
10 GHz. 

The parameters of Cqd and R G can be obtained as discussed earlier. Thus, the T sub data 
de-embedded from the measured T 2 2 data according to the above equations represents the 
contribution of the substrate network. 

To extract the substrate components, such as the substrate resistance and junction 
capacitances, we further derive the following equation by doing a T-parameter analysis 
of the substrate network in Figure 3.45: 

F S ub ~ — ; — ; — — («Cdb)' + jTkCdb = Rsub(wCDB)“ + JtoCoB (3.62) 

Adb + R SB + «dsb 



where 



(wC sb ) 2 ^b 



« 1, 



RdB + ^SB + ^DSB 
These assumptions are valid in 



(o»C S b) 2 ^db^sb ^ 1 

RdB + RsB + ^DSB 
the frequency range up to 



and (wC sb ) 2 ^sb < 1 • 
10 GHz. Therefore, we have 



Cdb 



Im{ Y sub } 
co 



_ Re{T sub } 
Im{T sub } 2 ' 



(3.63) 

(3.64) 



The extracted Cdb includes the contribution of both the intrinsic capacitance Cbdi and 
the drain junction capacitance Cjdb- The Cbdi can be separated from the extracted Cdb 
with the measured data at different V D s because Cjdb is a function of drain bias and 
Cbdi is approximately independent of the drain bias in the saturation regime. However, 
typically the capacitance Cdb is dominated by Cjdb- The value of Cjdb at zero bias can 
be extracted from Eq. (3.63) with the measured data at V D s = 0 V. The parameters to 
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Figure 3.46 Extracted values of Rq, Rs, and Rn at a given bias condition 




Figure 3.47 Extracted values of Cgg. Cgs, and Cgd at a given bias condition 



describe the bias dependence of CjDB can be extracted according to Eq. (3.63) with the 
measured data at different Vbs- 

Figure 3.46 shows the extracted resistances as a function of frequency with the tran- 
sistor at the given bias condition. It is shown that those components are frequency- 
independent. The extracted capacitances versus frequency are shown in Figure 3.47. For 
the given device in the figure, all of the capacitive components are also frequency- 
independent. The substrate resistance can be extracted from the slope of the Re{T sub ) 
versus Im{7 sub } 2 as shown in Figure 3.48. The parameters for R sb , R db , and R dsb can be 
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Imag(7 sub ) 2 ( 1/ohm 2 ) 

Figure 3.48 Illustration of the extraction of R sub from the plot of Re(F sub ) versus Im(F sub ) 2 . Here, 
Re(y sub ) is the real part of F sub and Im(T sub ) is the imaginary part of F sub 




1/Wf (1/pm) 

Figure 3.49 Extracted R sab from devices with different per-finger- widths. r dbw and r sbw can be 
further obtained through the plots of R sub versus 1 / W e g from devices with even- and odd-finger 
numbers. Reproduced from Cheng Y. et al. (2002c) Parameter extraction of accurate and scaleable 
substrate resistance components in RF MOSFETs, IEEE Electron Device Lett., 23(4), 221-223 

extracted further once the values of R sub for devices with different widths and finger 
numbers are obtained. 

Similarly, R sab in devices with different geometry can be obtained. Figure 3.49 gives 
the extracted R sub from the devices with different widths. It can be seen that R suh 
is approximately proportional to W f 1 for devices with the same number of fingers. 
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Figure 3.50 One-resistor EC for the substrate network. Reproduced from Cheng Y. et al. (2002c) 
Parameter extraction of accurate and scaleable substrate resistance components in RF MOSFETs, 
IEEE Electron Device Lett., 23(4), 221-223 

According to the analysis above, the three-resistor substrate network can be further sim- 
plified into the one-resistor network by considering R sub as an equivalent resistance of the 
three discrete resistors as shown in Figure 3.50. Once the R sub for devices with different 
channel widths are determined, the values of the parameters such as r d b W and r S b W can be 
obtained. r sbw is determined first from the obtained 7? sub in odd-finger devices with differ- 
ent channel widths according to Eq. (3.35), and then r dbw is determined from the obtained 
R suh in even-finger devices with different channel widths after obtaining r S b W - Devices 
with different channel lengths (besides different channel widths and fingers) should be 
measured at much higher frequencies than 10 GHz to extract the values of r dsb accurately. 
Here we are interested in frequencies up to 10 GHz, and we use calculated value for r dsb 
from the doping concentration in the substrate region (underneath the channel between the 
source and the drain). Depending on the processing conditions and the substrate material, 
r ds b can be different. 

3.5 NQS MODEL FOR RF APPLICATIONS 

The NQS effect should be included for an RF model to accurately describe the HF 
characteristics of devices if the devices themselves exhibit this effect at the operating 
frequency. Most MOSFET models available in circuit simulators use the quasi-static (QS) 
approximation. In a QS model, the channel charge is assumed to be a unique function of 
the instantaneous biases, that is, the charge responds to a change in voltages with infinite 
speed. Thus, the finite charging time of the carriers in the inversion layer is ignored. In 
reality, the carriers in the channel do not respond to the signal immediately, and hence, 
the channel charge is not a unique function of the instantaneous terminal voltages (quasi- 
static) but a function of the history of the voltages (non-quasi-static). This problem may 
become pronounced in RF applications, where the input signals may have rise or fall times 
comparable to, or even smaller than, the channel transit time. For long channel devices, 
the channel transit time is roughly inversely proportional to (Vgs — V*) and proportional 
to L 2 . Because the carriers in these devices cannot follow the changes of the applied 
signal, the QS models may give inaccurate or anomalous simulation results that cannot 
be used to guide circuit design. 
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The modeling of the frequency-dependent components caused by the NQS effect is 
challenging in compact models for circuit simulation 3 . Owing to the existence of the NQS 
effect, a MOSFET model based on the QS approximation may not accurately describe 
the HF device behavior. 

The NQS effect can be modeled with different approaches for RF applications: (a) Rq 
approach, in which a bias-dependent gate resistance is introduced to account for the dis- 
tributed effects from the channel resistance as discussed earlier (see Jin etal. (1998)); 
(b) A’, approach, in which a resistance A’, (as used in modeling a MESFET (Metal 
Semiconductor Field Effect Transistor) or HEMT (High Electron Mobility Transistor) is 
introduced to account for the NQS effect (see Chen and Deen (1998)); (c) transadmittance 
approach, in which a voltage-control-current-source (VCCS) is connected in parallel to the 
intrinsic capacitances and transconductances to model the NQS effect (see Enz and Cheng 
(2000)); and (d) core model approach, in which the NQS effect can be modeled in the core 
intrinsic model (see Chan etal. (1998) and Cheng etal. (1997b)). It should be pointed 
out that all of these approaches would have to deal with complex implementation issues. 

Both the R(, and the A| approaches will introduce additional resistance components in 
the model besides the existing physical gate and channel resistances measured at DC or 
low frequency, so the noise characteristics of the model using either Rq or A, approach 
need to be examined. Ideally, the NQS effect should be included in the core intrinsic model 
if the model can predict both NQS and noise characteristics without a large penalty in 
the model implementation and simulation efficiency. 




Figure 3.51 Measured and simulated results of Y 2 1 for a MOSFET with 1.35-p.m channel length. 
Model without considering the NQS effect cannot describe the HF device behavior. BSIM3v3 NQS 
model can predict accurately the Y 2 1 characteristics even though the device has strong NQS effect. 
Reproduced from Cheng et al. (2001b) Frequency dependent resistive and capacitive components 
in RF MOSFETs, IEEE Electron Devices Lett., 22(7), 333-335 



3 Because devices with longer Lf have lower fy and strong NQS effect, they usually are not suitable for small- 
signal RF applications. However, devices with longer Lf may be used in circuits such as switch or biasing 
circuits. It is still desirable that an RF model can simulate devices having obvious NQS effects. 
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Figure 3.52 Measured and simulated results of Yn. The fitting of the Y n characteristics of the 
model needs to be improved. Reproduced from Cheng et al. (2001b) Frequency dependent resistive 
and capacitive components in RF MOSFETs, IEEE Electron Devices Lett., 22(7), 333-335 

Some compact models such as BSIM3v3 with the consideration of the NQS effect have 
been verified with measurements for devices at the medium frequency range. Figure 3.51 
shows the simulation results by using the models with and without considering the NQS 
effect. It is clear that the model without the NQS effect cannot predict correctly the 
device behavior in both Y\\ and Y 21 . By including the NQS effect, BSIM3v3 can predict 
the measured data very well in both the real and imaginary parts of >21 ■ However, the 
model needs to be improved for fitting Yn (see Figure 3.52) as discussed by Cheng et al. 
(2001b). The inclusion of the NQS effect would be a desirable feature for an RF model 
even though it remains a question whether the devices in RF circuits for small-signal 
applications will operate in the frequency region at which the devices show significant 
NQS effects. 
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4 

Noise Modeling 



4.1 NOISE SOURCES IN A MOSFET 

Both passive and active components in a circuit will generate various types of noise. To 
understand the noise behavior, a single MOSFET can be considered as a small circuit 
with different resistive, capacitive, and active components as we have seen in the previous 
chapter. Thus different noise sources exist in a MOS transistor as shown in Figure 4. 1 with 
their power spectral densities (PSDs). They include (1) terminal resistance thermal noise at 
the gate, (2) terminal resistance thermal noise at the drain, (3) terminal resistance thermal 
noise at the source, (4) thermal noise and the flicker noise in the channel, (5) substrate 
resistance thermal noise, and (6) induced gate noise. 

In principle, flicker noise is a low-frequency noise and it mainly affects the low- 
frequency performance of the device, so it can be ignored at very high frequency. 
However, the contribution of flicker noise should be considered in designing some radio- 
frequency (RF) circuits such as mixers, oscillators, or frequency dividers that up-convert 
the low-frequency noise to higher frequency and deteriorate the phase noise or the signal- 
to-noise ratio. Channel resistance and all terminal resistances contribute to the thermal 
noise at high frequency (HF), but typically channel resistance dominates in the contri- 
butions of the thermal noise from the resistances in the device. Induced gate noise is 
generated by the capacitive coupling of local noise sources within the channel to the gate, 
and usually it plays a more important role as the operation frequency goes much higher 
than the frequency at which channel thermal noise dominates. 



4.2 FLICKER NOISE MODELING 

Among all noise sources, the flicker noise is the dominant source for phase noise in 
silicon MOSFET circuits, especially in the low-frequency-range. It sets a lower limit on 
the level of signal detection and spectral purity and is one of the factors limiting the 
achievable dynamic range of MOS ICs, so it is important for device and circuit designers 
to minimize this effect in order to improve the circuit performance. As designers begin to 
explore circuits with low-power and low-voltage MOSFETs, the impact of low-frequency 
flicker noise becomes more and more crucial for providing enough dynamic range and 
better circuit performance. 
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Figure 4.1 An equivalent circuit to illustrate the noise sources in a MOSFET. i G 2 , i s 2 , and / D 2 
are the noise contributions by the terminal resistances at the gate, at the source, and at the drain; 
Id 2 is the noise contribution in the channel, including the flicker noise portion; /'db 2 . *'sb 2 , and (dsb 2 
are the noise contributions by substrate resistances; and i g 2 is the induced gate noise 

4.2.1 The Physical Mechanisms of Flicker Noise 

Noise at low frequencies in a MOSFET is dominated by flicker noise. Measurements gen- 
erally show a spectral density of the input (gate) referred voltage noise, which is roughly 
inversely proportional to frequency, as shown in Figure 4.2. Therefore, flicker noise is also 
called 1 // noise. Much effort has been made in understanding the physical origin of flicker 
noise. However, the physical mechanism is still not very clear so far. A lot of discussions 
and investigations are continuing to find a universal model to explain the experimental 
results reported by different research groups that use devices from different manufacturers. 




Figure 4.2 Drain current noise spectral density of an n -channel MOSFET. Reproduced from Hung 
et al. ( 1990b) A physical-based MOSFET noise model for circuit simulation, IEEE Trans. Electron 
Devices, 37, 1323-1333 




FLICKER NOISE MODELING 121 



Although there are probably several different physical mechanisms resulting in noise in 
MOSFETs, there are strong indications that traps at the Si-SiCV interface play the most 
important role, as discussed by Jindal and Ziel (1978). Electron trapping and de-trapping 
can lead to conductance variations. The exact mechanism is still under discussion; however, 
basically, there are three different theories on the mechanism of flicker noise as follows: 

1. Carrier-density fluctuation models (number fluctuations), predicting an input referred 
noise density independent of the gate bias voltage and proportional to the square of 
the oxide thickness; 

2. Mobility fluctuation models, predicting an input referred noise voltage increasing with 
gate bias voltage and proportional to oxide thickness; 

3. Correlated carrier and mobility fluctuation models, a unified model proposed by Hung 
et al. (1990a) with a functional form resembling the number fluctuation model at low 
bias and the mobility fluctuation model at high bias. 

In the carrier density fluctuation model, the noise is explained by the fluctuation of 
channel-free carriers due to the random capture and emission of carriers by interface traps 
at the Si-SiCV interface. According to this model, the input noise is independent of the 
gate bias, and the magnitude of the noise spectrum is proportional to the density of the 
interface traps. A 1 // noise spectrum is predicted if the trap density is uniform in the oxide. 
Measurements of devices from many different CMOS processes with oxide thickness 
between 10 and 80 nm suggest that nMOS transistors behave as predicted by the number 
fluctuation model (see, for example, Vandamme (1994)). However, noise measurements 
of newer deep submicron transistors present a much less consistent picture. For instance, 
hMOS transistors also may show bias dependence, while p M OS transistors may have a 
noise corner frequency comparable to «MOS transistors. Also, the experimental results 
show a 1//" spectrum and n is not always 1 but in the range of 0.7 to 1.2. Some 
experimental results even show that n decreases with increasing gate bias in /^-channel 
MOSFETs. Modified charge density fluctuation theories have been proposed to explain 
these experimental results. The spatial distribution of the active traps in the oxide is 
assumed to be nonuniform to explain the technology and the gate-bias dependence of n. 

The mobility fluctuation model considers flicker noise to be the result of fluctuations 
in carrier mobility based on Hooge’s empirical relation for the spectral density of the 
flicker noise in a homogeneous device. It has been proposed that the fluctuations of the 
bulk mobility in MOSFETs are introduced by changes in the phonon population. The 
mobility fluctuation models predict a gate bias -dependent noise. However, they cannot 
always account for the magnitude of the noise. 

The unified theory for the origin of the 1 If noise suggests that the capture and emis- 
sion of carriers by the interface traps cause fluctuation in both the carrier number and 
the mobility. All unified noise models assume implicitly that the mobility, limited by 
Coulomb scattering at trapped interface charges, does not depend on the inversion car- 
rier density. However, recent experimental results indicate that the mobility, limited by 
Coulomb scattering, is proportional to the square root of the inversion carrier density 
(see Vandamme et al. (2000); Vandamme (1994)). Recently, some arguments even claim 
that the correlated mobility fluctuations can be neglected compared to the noise con- 
tribution from carrier number fluctuations, if the correct dependence of the Coulomb 
scattering -limited mobility on inversion carrier density is taken into account. As a result. 
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the unified noise models cannot predict the experimentally observed noise as a function of 
gate bias in /My pc MOSFETs unless nonphysical fitting parameters are used (Vandamme 
et al. (2000)). Nevertheless, even though this unified theory cannot explain all the details 
of the experimental data, it seems to be the most attractive model available today in 
circuit simulators. 



4.2.2 Flicker Noise Models 



It is for historic reasons that different flicker noise models have been developed on 
the basis of the three different approaches discussed in the preceding text. They are 
implemented in different simulators such as HSPICE, SPECTRE, ELDO, PSPICE, and 
so on. Almost all of the commercial simulators provide different options for users to 
select different noise models in noise simulation together with a specific compact model, 
such as MOS 9, EKV, and BSIM3v3, for simulations such as DC, AC, small signal, or 
transient analysis. For example, HSPICE includes three different models for the drain 
current flicker noise that are distinguished with different model levels (0-3). 

For NLEV = 0: 



Sid = 



KfIds Af 

Cox^eff 2 / 



(4.1) 



where Sa is the drain current noise power spectral density, /ds is the drain current, 
C ox is the unit-area gate oxide capacitance, L e ff is the effective channel length, / is the 
frequency, K F and A F are the fitting parameters. 

For NLEV = 1 : 



Sid = 



K f Ids Af 

C m L eS W eS f 



(4.2) 



where VT e ff is the effective channel width. 
For NLEV = 2 and 3: 

S id = 



k f g„ 



C ox L eS W eS f A r 



(4.3) 



where G m is the transconductance of the device and A F is a fitting parameter. 

In fact, some compact models have their own flicker noise models. For example, 
BSIM3v3 introduces two flicker noise models (Cheng et al. (1997)). One is the SPICE2 
flicker noise model (Vladimirescu (1994)), while the other is the unified flicker noise 
model. The latter is a newer model developed recently and has been considered a more 
accurate model than the SPICE2 flicker noise model (Hung et al. 1990b). The reason the 
SPICE2 flicker noise model is included in BSIM3v3 is to provide the convenience to 
some BSIM3v3 users who were familiar with the SPICE2 flicker noise model before the 
unified BSIM3 noise model was developed and who want to continue using it in noise 
simulation (Cheng and Hu (1999)). 

The SPICE2 flicker noise model is 



K F Ids A 

~ CoxTeff 2 /^ 



where E F is a fitting parameter. 
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The unified flicker noise model in BSIM3v3 is more complex. Basically, it includes 
a portion equivalent to the SPICE2 flicker noise model given by Eq. (4.4), but contains 
another portion to give a more accurate description of the flicker noise characteristics in 
the saturation region (Cheng and Hu (1999)). 

Currently, it is a fact that many different noise models are included in circuit simula- 
tors. However, it has to be pointed out that these models in commercial simulators are 
not fully compatible with each other. For example, the geometry dependence between 
Eqs. (4.1) and (4.3) are different, and the bias dependence between them is also different. 
Furthermore, those flicker noise models contain different oxide thickness dependencies. 
Modeling engineers and circuit designers need to be aware of this when performing noise 
simulation. A lot of work has been done to verify the accuracy of the flicker noise models 
over various bias conditions (see, for example, Vandamme et al. (2000)), but further work 
is still needed to develop a better flicker noise model that can explain most (if not all) of the 
experiments. So a careful selection of the flicker noise model is required to make sure that 
the model will predict reasonable noise performance according to the circuit applications. 

4.2.3 Future Work in Flicker Noise Modeling 

4.2.3. 1 Flicker noise modeling with the consideration of new physical 
mechanisms in MOSFETs with ultrathin oxides 

The above physical mechanisms of flicker noise are the ones we have frequently encoun- 
tered in literature. However, as the technology enters more advanced stages, new noise 
mechanisms may appear and play an important role. For example, it has been reported 
that the influence of a new mechanism on flicker noise performance should be accounted 
for in ultrathin oxide MOS transistors (e.g., 1.5 nrn or less) owing to direct tunneling 
currents that will alter the characteristics of the Ilf noise, depending on the length of the 
channel and the thickness of the gate oxide, as shown in Figures 4.3 and 4.4. 




Figure 4.3 Gate oxide thickness dependence of flicker noise in /(-channel MOSFETs with 0. 15-|xm 
and 0.2-|xm gate channel lengths. Reproduced from Momose H. S. et al. (1998) A study of flicker 
noise in n- and p-MOSFETs with ultra-thin gate oxide in the direct-tunneling regime. Tech. Dig. 
Int. Electron Device Meeting , 923-926 
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Figure 4.4 Gate length dependence of flicker noise in n -channel MOSFETs with various gate 
oxide thicknesses. Reproduced from Momose H. S. etal. (1998) A study of flicker noise in n- 
and p-MOSFETs with ultra-thin gate oxide in the direct-tunneling regime. Tech. Dig. Int. Electron 
Device Meeting, 923-926 

In Figures 4.3 and 4.4, the gate length and the oxide thickness dependence of gate 
referred voltage noise are shown at 1-kHz operation. Figure 4.3 shows the gate oxide 
thickness dependence of the gate referred voltage noise in devices with 0.15-p.m and 
0.2-p,m channel lengths. For the devices with gate lengths less than 0.2 p,m, the flicker 
noise in a device with 1.5-nm gate oxide thickness is lower than that in devices with 
thicker gate oxides. It means that the noise characteristics of devices have been improved 
with decreasing gate oxide thickness for the devices with such short channel lengths, 
although the gate leakage current becomes larger in the former. A possible mechanism 
for the lowering of flicker noise in the devices with thinner oxides is the appearance of 
band-to-band tunneling. However, as also shown in Figure 4.5, for devices with channel 
length longer than 0.2 |un, the flicker noise in the device with 1.5-nm gate oxide is higher 
than that in the device with thicker oxide (2.2 nm). An understanding of this result has led 
to the theory that the higher flicker noise in such devices with longer (than 0.2 |rm) channel 
length and thinner ( 1 .5 nm) gate oxide was caused by the much larger gate leakage current 
as the devices with longer channel lengths have larger gate area. Further theoretical and 
experimental investigations on this issue are needed to fully understand the contribution of 
the band-to-band tunneling and gate leakage to the flicker noise characteristic in today’s 
devices. A compact flicker noise model with the consideration of band-to-band tunneling 
and gate leakage has not been reported so far. 



4.23.2 Modeling and simulation of flicker noise under switched 
bias conditions 

It has been reported that devices under switched bias conditions show lower flicker noise 
than those measured at DC bias conditions (Wei et al. (2000) and Klumperink et al. 
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Frequency (Hz) 

Figure 4.5 Noise reduction as a function of the “off’ voltage for an «MOS, Vos, on = 2.5 V, 
Vth = 1.9 V, / sw i tC h = 10 kHz, duty cycle = 50%. Reproduced from Klumperink E. A. M. et al. 
(2000) Reducing MOSFET 1/f noise and power consumption by switched biasing, IEEE J. 
Solid-State Circuits, 35(7), 994-999 



(2000)). Figure 4.5 shows a typical measurement result. The noise spectrum between 
10 Hz and 100 kHz is shown for constant biasing (no switching) together with noise spectra 
resulting from a 10-kHz switched bias signal with 50% duty cycle. For 50% duty cycle, a 
low-frequency noise power that is reduced by 6 dB compared to the constant-bias situation 
is expected. Further noise reduction is observed when the gate-source voltage in the 
“off’ state is decreased, indicating an increasing noise reduction closer to accumulation. 
Figure 4.6 shows the results at various switching frequencies. All noise spectra appear to 
“merge” at low frequencies, with about 7 dB of intrinsic noise reduction (apart from the 
6 dB related to 50% duty cycle). Even at megahertz frequencies, where the settling of the 
output voltages becomes incomplete, this noise reduction is found. 

As switched biasing has been proposed as a technique for reducing the flicker noise 
in MOSFET’ s with reduced power consumption to benefit HF circuits (Klumperink et al. 
2000), it becomes essential for RF MOSFET models to give a reasonable prediction of 
flicker noise performance of the device under such conditions. In order to do that, the 
flicker noise model contained in the RF model must be continuous and accurate over 
a wide bias range from strong inversion to accumulation and from linear to saturation 
regimes. Further work is needed to validate the flicker noise models with measured noise 
data in devices under switch-biasing conditions and to develop more advanced noise 
models for RF applications. 
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Figure 4.6 Noise reduction while switching at different frequencies for an nMOSFET, 
Vos.on — 2.5 V, Vos.off = 0 V, duty cycle = 50%. Also shown is the noise floor under the same 
conditions. Reproduced from Klumperink E. A. M. et al. (2000) Reducing MOSFET 1// noise 
and power consumption by switched biasing, IEEE J. Solid-State Circuits , 35(7), 994-999 



4.3 THERMAL NOISE MODELING 



4.3.1 Existing Thermal Noise Models 



At HF, although all the noise sources contribute to the total noise, the dominant contri- 
bution comes from the channel thermal noise. The channel thermal noise characteristics 
in MOSFETs operating in the strong inversion region have been studied for over two 
decades. The origin of this thermal noise has been found to be related to the random ther- 
mal motion of carriers in the channel of the device. Various models have been developed 
and some of them have been implemented in circuit simulators. 

A simple thermal noise model has been implemented in circuit simulators since SPICE2 
was developed. 



Sid = 



8/u' TG, r 



(4.5) 



where £ B is the Boltzman constant and T is the absolute temperature in Kelvin; G m is 
the transconductance of the device. 

Other similar models have also been proposed as given in the following: 



Sn — 



Sid = 



8AbTGds 
3 ' 

$knT(G m 



+ Gds) 



(4.6) 



3 



(4.7) 




THERMAL NOISE MODELING 127 



Sid = 



8^b T(G m + G ds + G m b) 



(4.8) 



where Gds and G m b are the channel conductance and bulk transconductance. 

Most compact models developed for circuit simulation have their own thermal noise 
models. For example, BSIM3v3 includes the following equation to calculate the thermal 
noise of the device as a user option besides the one given by Eq. (4.8): 



S id = 



4 k B T /r e ff 

L e ft 2 



Q inv 



(4-9) 



where /r C ff is the effective carrier mobility, L t{{ is the channel length of the device, and 
<2j nv is the total inversion charge in the channel. 

It has been reported that Eq. (4.5) gives a nonphysical prediction of thermal noise at 
F ds = 0 V (see, for example, Want et al. (1994)). Equations (4.7) and (4.8) are proposed 
to fix this problem even though their accuracy and physical basis need to be verified. 
Studies to validate the accuracy of the above noise models have been reported recently 
(see, for example, Chen et al. (2000)). Some discussion will be given later. 

Another thermal noise model that is not implemented in all commercial circuit simu- 
lators but widely used for noise analysis by circuit designers is 



Sind = 4£fi ^Gnch, 
= 4 k s T yG m 



(4.10) 



where G nc h is the channel thermal noise conductance and y is a bias-dependent factor, 
which for long-channel devices is equal to unity in the linear region and to 2/3 in the 
saturation region. 

The /-factor has been used as a figure of merit to compare the thermal noise perfor- 
mance of different devices. It shows how much noise is generated by the device at the 
input for a given transconductance. It has been found that the /-factor is not a constant 
for devices with different channel lengths and the /-factor for short-channel device can 
be larger than that for long-channel device in the saturation regime owing to both velocity 
saturation and hot electrons. Some models have been proposed to account for the velocity 
saturation effect and hot carrier effects (for example, Klein (1998), Scholten et al. (1999), 
and Knoblinger (2000)), but they have not been implemented in any commercial circuit 
simulators yet. Recently, a simple thermal noise model has been proposed to account for 
both velocity saturation and hot carriers and can be easily implemented. This noise model 
was originally developed for a transistor biased in saturation and in strong inversion, 
but an extended expression has been proposed to cover the regions from weak to strong 
inversion by rewriting the noise parameter as (Enz and Cheng (2000)) 



Y = Yl 




1 CatT \ 



(4.11) 



where v sat is the saturation velocity, r r is a relaxation time (of the order of ps) used 
as a fitting parameter, G is the normalized G,„//d ratio, and / L is the /-factor for the 
long-channel device. 
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This simple model assumes that the carrier velocity is saturated and that the lateral 
held is equal to the critical held all along the channel from source to drain. Although 
these assumptions are questionable, the resulting model can ht the measured data over 
bias and geometry. 



4.3.2 HF Noise Parameters 

In noise model derivation and circuit simulation, the noise PSD is used as a measure for 
the noise output in the device. Circuit designers also prefer to use the noise PSD, the 
parameters related to the noisy two-port equivalent circuit. However, in measurements, the 
HF noise is usually characterized by several other parameters: the minimum noise factor 
(or minimum noise hgure), the input referred noise resistance, and the optimum source 
admittance for which the minimum noise hgure is obtained. Therefore, it is necessary to 
discuss these parameters to understand their physical meanings and their relationship in 
describing the HF noise characteristics of the device. 

The noise factor is a hgure of merit for the performance of a device or a circuit with 
respect to noise. The standard dehnition of the noise factor of a two-port network is the 
ratio of the available output noise power per unit bandwidth to the portion of that noise 
caused by the actual source connected to the input terminals of the device. It can be given 
by the following equivalent equation: 



Sj/Nj 

S o /N 0 



(4.12) 



where S, and S 0 are input and output signals and N\ and N 0 are input and output 
noise power. 

The noise factor can be expressed in decibel form, which is termed as the noise hgure, 
that is, 

N P — 10 log F. (4.13) 



The noise hgure of a two-port network is given by (see, for example, Pospieszalski 
(1986)) 

N F — N Fmin -\ \y s - y op tl 2 (4.14) 

gs 

where r„ is the equivalent normalized noise resistance of the two-port network, y s = g s + 
jb s is the normalized source admittance, and y opt = g op t + jb op t represents the normalized 
source admittance that results in the minimum (or optimum) noise hgure /V|.- min . 

The y s and _y opt can be expressed in terms of the reflection coefficients T, and T opt 
the ratio of the incident to the reflected wave along a transmission line 






i -r, 
i + r s 



(4.15) 



and 




1 + r opt 



Vopt — 



(4.16) 
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Figure 4.7 An example of measured -/V Fmin versus frequency for a MOSFET 



i 1 i 1 i 1 r 



i 1 r 



Vns = 1 V 



Finger numbers = 10 
Width per linger = 6 pm 
Length = 0.18 pm 



"I ' 1 ' 1 ' T 



1 ' T 



Thus, Eq. (4.14) becomes the following form 



N F = + 



4 Yn • IT, - T opt | 2 

(i - |r,| 2 ) ■ |i + r opt | 2 ' 



(4.17) 



In the HF noise measurements, the source reflection coefficient is varied until a min- 
imum noise figure is reached. The value of A Fmin , which occurs when T, = r opt , is read 
from the noise figure meter, and the source reflection coefficient that produces AT min is 
determined by a network analyzer. The noise resistance r n is measured by reading the 
noise figure when T, = 0. 

/V[7 min is a function of the biases (operating current) and frequency. Each AT min is asso- 
ciated with one value of T opt Figure 4.7 shows a typical measured characteristic of lV Fmin 
versus frequency for an RF MOSFET. Figure 4.8 gives a typical measured plot of lV Fmin 
versus bias current. According to the measured noise characteristics, MOSFET can pro- 
vide a low noise figure that is attractive to the RF applications. Also, a careful selection 
of the bias conditions is important for the device to achieve a lowest noise performance 
as shown in Figure 4.8. Since an RF MOSFET with a very short channel length includes 
many different physical effects and contains nonnegligible parasitics, it is not very easy to 
optimize the noise performance of the devices in a circuit with hand calculation by using 
analytical equations. So it is desirable that an RF model with accurate noise prediction 
be developed for use in circuit simulation. Whether an RF model can accurately predict 
the characteristics of noise figure versus bias currents for devices with different sizes is 
another challenge for device model developers. 

As mentioned above, circuit designers prefer to use the parameters related to a two- 
port network to describe the noise performance of a device and a circuit. Universal noise 
models have been developed for any two-port network. A noisy two-port network shown 
in Figure 4.9(a) can be represented by a noise-free two-port network with two noise 
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Figure 4.8 An example of measured /V|. imn versus / DS for a MOSFET 
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Figure 4.9 Noisy two-port and its ABCD-parameter representation 
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current sources, one at the input port (i \ ) and the other at the output port (i 2 ) as shown in 
Figure 4.9(b). Figure 4.9(b) can also be transformed into a noise-free two-port network, 
shown in Figure 4.9(c), with a noise current source, 5),, = 4k B TG m , and a noise voltage 
source, 5),, = 4k B TR m , at the input port, where the i>„ and /„ are correlated to each other 
and the correlation relationship is described by a correlation admittance Y c = G c +JB C . 
The noise source i n can be further separated into a noise source i nu that is uncorrelated 
to u n and a noise source i nc that is fully correlated to v„, 

in — inu 4“ inc (4.18) 

and 

inc = Vcv„. (4.19) 

The above relationship can be expressed in terms of noise PSD as follows: 

Sin = s inu + S inc , (4.20) 

Si„c = |k c | 2 S vn . (4.21) 

According to the two-port network given in Figure 4.9, we further have the following 
relationships: 



1 



\7 ^*2 ’ 


(4.22) 


*21 




h + kiiDn, 


(4.23) 


Yn~Y 2l ^P = G c + jB c , 

^i 2 


(4.24) 


12 o = 4k B TR m , 

|y 2 i| 2 


(4.25) 


4k B TG in . 


(4.26) 



On the basis of the above relationships, the four noise parameters discussed earlier can 
be calculated, 



Rn = 


Rva, 


(4.27) 


^opt = 




(4.28) 


Bopt = 


- B c , 


(4.29) 


N-p . = 

1 min 


1 + 2i? n (G c + Gopt). 


(4.30) 



Similarly, noise parameters related to the two-port network can be calculated once we 
have the four noise parameters, R a , G opt , B 0 pt, and AT min , 



R va — R n , 



(4.31) 
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(4.32) 

(4.33) 



B C ^opt 5 

Gin = (G opt 2 + B c 2 )R n 

Nf — \ 

G c - G^. (4.34) 

z/v n 

To this point, we have established a conversion relationship between the four noise 
parameters obtained from the measurements and the noise parameters related to the two- 
port network for circuit analysis. Detailed analysis can be further performed for the noise 
performance of the device and a circuit based on the above derivations. 

According to the noisy two-port network theory, a useful equation for the PSD of i 2 
can be obtained as follows (Gonzalez (1997)): 

S i2 = S va \Y2i | 2 = 4k B TR n \Y 2l \ 2 (4.35) 

where k B is the Boltzmann’s constant, T is the absolute temperature, Y 2 \ is the transad- 
mittance from port 1 to port 2 of the noise-free two port, and R n is the equivalent noise 
resistance, which is a resistance cascaded at the input port that will produce the same 
amount of noise PSD as i 2 does at the output port. 



4.3.3 Analytical Calculation of the Noise Parameters 



Figure 4.1 illustrates all noise sources in a MOSFET. However, it is too complex to be 
used to calculate the contribution of each noise source analytically. A simplified equivalent 
circuit (EC) shown in Figure 4.10 can be obtained by neglecting some components in 
Figure 4.1. 

In Figure 4.10, the capacitances Cbs and Cbd have been neglected and the influence 
of the different substrate resistance components have been taken care of by R sub . On the 
basis of the equivalent circuit, the following noise parameters are obtained: 



R - — # 

m 

Gin — 7. S atG m 0“l//' , 



G c = 



R a (G m e ) 2 t 



(4.36) 

(4.37) 

(4.38) 




Figure 4.10 Simplified small-signal schematic for noise calculation 
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B c 



G m 0x 

ft 



ft — 1 + oig + o? SU b + ( G,„Rg9 ) Vl 



0 = 2 nf 



C G s 



(4.39) 

(4.40) 

(4.41) 



where /. sat is a parameter proportional to y -factor discussed earlier, parameters t// and 
X account for the induced gate noise and its correlation to the drain noise, a„ is the 
ratio of the noise PSD of the gate resistance to the input referred channel noise, and a sub 
is the ratio of the output referred substrate resistance noise PSD to the output referred 
channel noise, 



= 



^sub — 



G m Rg 
^-sat 

2 p 

^ mb /v sub 



Parameters ij/ and x are given in the following: 



i' — 1 + 0! sub + 1- 2c 

Xsat 




X = 



1 -f Q! sub C. 




(4.42) 

(4.43) 



(4.44) 

(4.45) 



where er sat is a bias- and geometry-dependent factor, c g is a fitting parameter that is 0.395 
for a long-channel device. 

Both i/f and / reduce to 1 + Q! sub when the induced gate noise is ignored (er sat = 0). 
The induced gate noise contributes mainly to G m through the factor i J/, the gate resistance 
R(, contributes to R vn , and the channel noise and substrate noise contribute to both G in 
and R vn . Substrate noise may typically contribute to 20% of R vn , whereas R a typically 
contributes to about 5% of R va . It is therefore important to account for the substrate 
resistance when doing noise calculation and noise optimization. 

The noise parameters of an n -channel device have been measured and carefully de- 
embedded using the methodology presented in some literatures (see, for example, Aufinger 
(1996)). They are presented in Figure 4.1 1 and compared to the results obtained from sim- 
ulation using the complete subcircuit of Figure 4. 1 with the additional induced gate noise 
source added to the subcircuit (but not accounting for the correlation between induced gate 
noise and drain thermal noise). The results obtained from Eqs. (4.36) to (4.45) includ- 
ing the correlation between induced gate noise and channel drain noise are also shown 
in the figures. The meaning of the symbols (si, s2, s3, s4) shown in Figure 4.11 are 
defined as follows, as discussed by Enz and Cheng (2000). si enables the gate resistance 
noise, s2 enables the substrate resistance noise, s3 enables the induced gate noise with- 
out correlation (c g = 0), and s4 enables the induced gate noise correlation (c g = 0.395), 
respectively, in the simulations. Figure 4.11 shows that the gate and substrate resistances 
strongly affect the minimum noise figure AT„, in , the optimum noise conductance G op t, and 
the input referred noise resistance R v „. The induced gate noise slightly affects AC rnin and 
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G 0 pt , but has no effect on R vn . From these results, it can be concluded that induced gate 
noise is not the only contributor to the minimum noise hgure and that the gate resistance 
and, more importantly, the substrate resistance also contribute signihcantly. 

Note that the analytical expressions for the noise parameters give reasonable results 
below fj/5 and the discrepancies appearing at HF between the analytical and the measured 
results mainly come from a wrong frequency behavior due to the very simple equivalent 
circuit used for the derivations of Eqs. (4.36) to (4.45). 



4.3.4 Simulation and Discussions 

With the extracted parameters from the measured data for a 0.25-p.m RF CMOS tech- 
nology, the noise characteristics of the subcircuit model discussed above are verified. 
The four noise parameters calculated with the simulated noise characteristics are given 
in Figure 4.12 against the measured data for a 0.36-p.m device at different bias con- 
ditions. While the RF model with extracted parameters fits accurately the measured 
i-parameters data, it can also predict well the HF noise characteristics of the device 
as shown in Figures 4.12 to 4.15 by the curves at Vgs = 1 V and Vds = 1 V. The dis- 
crepancy in the R n characteristics between the model and the measured data at V GS = 2 V 
needs further investigation. However, obvious disagreement in the simulated and mea- 
sured imaginary part of Yu has been found at that bias condition, so the discrepancy in 




Figure 4.11 Comparison between measured and simulated noise parameters (Ww R n , 
G op t, 5 0 pt)- Reproduced from Enz C. and Cheng Y. (2000) MOS transistor modeling for RF IC 
design, IEEE Journal of Solid-State Circuits, 35(2). 186-201 
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Figure 4.12 Comparisons of measured data for minimum noise figure, /V Fmin , with simulations 
at different bias conditions. The Nf is the finger number of each device. The channel width per 
finger is 10 p,m and the channel length is 0.36 p,m. Reproduced from Cheng Y. et al. (2002) High 
frequency small signal AC and noise modeling of MOSFETs for RF IC design, IEEE Trans. On 
Electron Devices, 49(3), 400-408 




Figure 4.13 Comparisons of measured data for the magnitude of the optimized source reflection 
coefficient r op t, with simulations at different bias conditions. Reproduced from Cheng Y. et al. 
(2002) High frequency small signal AC and noise modeling of MOSFETs for RF IC design, IEEE 
Trans. On Electron Devices, 49(3), 400-408 



the R n characteristics may be caused by the inaccuracy of the capacitance model in that 
operation regime. 

The noise characteristics of several noise models including the subcircuit RF model 
above are also verified with the extracted channel thermal noise with the methodology 
discussed by Chen (2001). Figure 4.16 shows the curves of the channel thermal noise 






136 NOISE MODELING 




Figure 4.14 Comparisons of measured data for the phase of the optimized source reflection coef- 
ficient, r opt , with simulations at different bias conditions. Reproduced from Cheng Y. et al. (2002) 
High frequency small signal AC and noise modeling of MOSFETs for RF IC design, IEEE Trans. 
On Electron Devices , 49(3), 400-408 




Figure 4.15 Comparisons of measured data for the noise resistance normalized to 50 f2, r n , with 
simulations at different bias conditions. Reproduced from Cheng Y. et al. (2002) High frequency 
small signal AC and noise modeling of MOSFETs for RF IC design, IEEE Trans. On Electron 
Devices, 49(3), 400-408 

versus bias current, from the measured data, and simulations of BSIM3v3 noise model 
(Noimod = 4) and several other noise models given by Eqs. (4.6) to (4.8). It shows 
that the calculated channel thermal noise based on Eqs. (4.6) to (4.8) cannot predict 
the channel thermal noise extracted from measured data. The subcircuit RF model with 
the BSIM3v3 noise model (Noimod = 4) has much better accuracy at several different 
bias conditions. 
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Figure 4.16 Power spectral densities of channel thermal noise versus bias current of a 0.36-p.m 
n-channel MOSFET. They are extracted from the measured data and calculated from different 
channel thermal noise models. Reproduced from Cheng Y. et al. (2002) High frequency small 
signal AC and noise modeling of MOSFETs for RF IC design, IEEE Trans. On Electron Devices, 
49(3), 400-408 
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Figure 4.17 Illustration of induced gate noise and the equivalent circuit model 
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4.3.5 Induced Gate Noise Issue 

The concept of the induced gate noise has been used for three decades (see, for example, 
Halladay and Ziel (1969)). But many researchers are still studying how to model it 
correctly (Triantis (1997)). At high frequencies, it is believed that the local channel voltage 
fluctuations due to thermal noise couple to the gate through the oxide capacitance and 
cause an induced gate noise current to flow. This noise current can be modeled by a 
noisy current source connected in parallel to the intrinsic gate-to-source capacitance C G si 
as shown in Figure 4.17. Since the physical origin of the induced gate noise is the same 
as that for the channel thermal noise at the drain, the two noise sources are partially 
correlated with a correlation factor c g . 

The power spectral density of the induced gate noise PSD is given by 



•Sing = 4k B TG ng 



and 



_ cr S at(27r/C G s) 2 
n 



(4.46) 

(4.47) 



Device noise simulations performed for a finger length have shown that the correlation 
factor remains mainly imaginary (real part about 10 times smaller than the imaginary 
part) and that its value is slightly smaller than the long-channel value 0.395 (it typically 
ranges from 0.35 to 0.3 for short-channel devices). 

Currently, the induced gate noise and its correlation to the thermal noise at the drain 
are not yet implemented completely in compact models. One reason is the difficulty of 
modeling the induced gate noise and implementing it in circuit simulators. Another reason 
is that it is not very critical at frequencies much smaller than the f T of the device, since at 
that frequency range two more important contributors to the total noise are the substrate 
and the gate resistances, instead of the induced gate noise, besides the channel thermal 
noise. A methodology to extract the induced gate noise has also been developed. However, 
further detailed investigations are needed to understand the induced gate noise issue and 
model it correctly. The subcircuit RF model discussed in this paper does not include the 
contribution of the induced gate noise. 
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5 

Proper Modeling for Accurate 
Distortion Analysis 



5.1 INTRODUCTION 

To minimize the influence of noise in analog and RF integrated circuits, signal amplitudes 
are made large. This will cause high distortion effects caused by the inherent nonlinearity 
of the devices used to implement the circuit functions. The literature on distortion analysis 
is mainly focusing on operational amplifiers (OPAMPS), RF amplifiers, transconduc- 
tors (voltage-to-current converters), and more recently track-and-hold amplifiers (see, for 
example, Shoucair and Patterson (1993); Mensink et al. (1996); Bruun (1998); Wambacq 
et al. (1999); Chilakapati et al. (2002); Flernes (2003); Limotyrakis et al. (2002)). 

All devices available to designers of analog CMOS circuits are more or less nonlinear 
and hence can cause circuit functions to degrade owing to distortion effects. Especially 
MOS transistors, which are the most accurate and effective devices for implementing 
analog functions in CMOS technology, are prone to exhibit nonlinear characteristics. 
With the continuous downscaling of feature sizes, the transistor characteristics are more 
and more augmented by second-order effects such as mobility degradation due to high 
vertical fields, velocity saturation in the channel, and other nonideal effects that were 
discussed in Section 1.5.2. All these effects will introduce additional nonlinearities (see, 
for example, van Langevelde and Klaassen (1997a, 1997b); Pu and Tsividis (1990)). 

The characterization of MOS transistors that is provided by the foundries is commonly 
based on fitting the modeled drain current characteristics and the small-signal capacitors 
to measured data. This is so-called digital characterization. In some cases, the foundries 
also include fitting of small-signal parameters such as transconductance g m and channel 
conductance gds to satisfy the needs of analog designers. The main objective of this chapter 
is to illustrate that even accurately reproducing g m and gds is not sufficient for proper 
estimation of distortion and intermodulation effects. For example, the third-order harmonic 
distortion component depends on the third-order derivative of the device characteristics. 

This chapter is organized as follows. We start by defining basic terminology commonly 
used in analysis of distortion effects. Then in Section 5.3, we discuss the nonlinear- 
ity of the MOS devices using the terminology just described and point out important 
requirements of the models to be able to predict the effects of the nonlinearities. Finally, 



Device Modeling for Analog and RF CMOS Circuit Design. T. Ytterdal, Y. Cheng and T. A. Fjeldly 
© 2003 John Wiley & Sons, Ltd ISBN: 0-471-49869-6 





142 PROPER MODELING FOR ACCURATE DISTORTION ANALYSIS 



in Section 5.4, we briefly discuss important methods commonly employed to calculate 
distortion in analog CMOS circuits. 



5.2 BASIC TERMINOLOGY 

For analog circuits, the linearity characteristics are often described in terms of parameters 
that are measured in the frequency domain for circuit excitations being one or more 
sinusoidal sources. In single sinusoidal excitation the input signal is of the form A cos(cot), 
where A and co are the amplitude and the frequency of the input signal, respectively. This 
situation is referred to as single-frequency excitation. 

When the amplitude of the input signal A is small enough, the output spectrum of 
the circuit under test contains only one frequency component above the noise floor. This 
component is located at the same frequency as the input signal, that is, at <x> and is called 
the fundamental frequency. The amplitude of the fundamental frequency signal changes 
proportionally with the input amplitude. As the input signal amplitude is increased, the 
output spectrum also contains components at the frequencies 2 co and 3 co. These signals, 
called the second and third harmonics, originate from second- and third-order nonlin- 
ear circuit behavior, respectively, as we will see later. Harmonics higher than the third 
come above the noise floor at even higher values of the input amplitude. We will also 
learn later that the amplitude of the /7 th harmonic increases as the z/th power of the 
input amplitude. 

Usually, the linear response is the wanted response and it is therefore common to 
denote the harmonics as distortion, nonlinear distortion, or harmonic distortion. 

To define and illustrate the different quantities used in distortion analysis, we consider 
a nonlinear system with an input signal x(t) and an output signal >>(/). The output signal 
can be written as a Taylor expansion of the input signal as follows. 

y ft) — a\x(t) + a2x 2 (t) + a3x 3 (t) + ■ ■ ■ . (5.1) 



Here, the coefficient a 1 describes the behavior of the linearized circuit. The higher-order 
coefficients a2, a3 , ... are called the higher-order nonlinearity coefficients. 

The total harmonic distortion ( THD ) of a signal is defined to be the ratio of the total 
power of the second- and the higher-order harmonics to the power of the fundamental 
component of that signal. There exist several mathematical expressions for this definition. 
One commonly used approach is to express THD in decibels: 



THD = 10 log 




dB. 



(5.2) 



Usually, the second- and/or the third-order distortion components contribute the most 
to THD and we will denote these as HD 2 and HD^, respectively. They are defined as 
the ratio of the of the second and third harmonic, respectively, to the amplitude of the 
fundamental response: 



HD 2 



! a ]o2| 

2 \al\’ 



(5-3) 
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HD 3 = -A 



2 |o3| 
4 lall' 



(5.4) 



These distortion components are also often referred to as harmonic amplitudes. 

If the output signal y(t ) is a known analytic function y(t) — /(.*(?)), then the coeffi- 
cients a 1, a 2, . . . can be written in terms of the Taylor series coefficients. The first three 
coefficients are then 



II 


(5.5) 


, Id 2 / 
" 2= 2dtZ' 


(5.6) 


, 1 d 3 / 

a 3 = 

6 dx 3 


(5.7) 



In low-pass and band-pass systems, the higher-order harmonics of single-frequency exci- 
tation often fall outside the system bandwidth and hence, are not possible to measure. 
To be able to characterize such systems, one usually applies an input signal containing 
two sinusoids at frequencies cc>\ and a >2 that are close to each other. The nice feature 
of the output spectrum of such a test applied to a nonlinear system is that it contains 
components at the sum and differences of the two frequencies. Hence, it is easy to define 
the two frequencies such that at least one of the components falls inside the system band- 
width. The signals at \a>\ ± 6j 2 | are caused by second-order nonlinear behavior and are 
called second-order intermodulation products. The output spectrum contains even higher- 
order intermodulation products at frequencies \2a>i ± a> 2 | , |o>i ± 2a>2 | , and so on. When 
characterization of intermodulation is performed, second-order intermodulation distortion 
(IMt) and third-order intermodulation distortion (IMt,) are usually measured. IM 2 and /M 3 
are defined as the second- and third-order intermodulation products to the fundamental 
response, respectively. 

Semiconductor devices are usually modeled in terms of equivalent circuits containing 
different basic circuit elements such as resistors, capacitors, and controlled current and 
voltage sources. The nonlinearity of these devices can be categorized as follows (Wambacq 
and Sansen 1998) and is referred to as basic nonlinearities: 

• Nonlinear conductance The current through the element is a nonlinear function of 
the voltage across the element. 

• Nonlinear transconductance The current through the element is a nonlinear function 
of a voltage somewhere else in the equivalent circuit. 

• Nonlinear resistance The voltage across the element is a nonlinear function of the 
current through it. 

• Nonlinear transresistance The voltage across the element is a nonlinear function of 
a current somewhere else in the equivalent circuit. 

• Multidimensional nonlinear conductance and transconductance The current through 
the element is a nonlinear function of more than one voltage. 

• Multidimensional nonlinear resistance and transresistance The voltage across the 
element is a nonlinear function of more than one current. 
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• Nonlinear capacitance The charge stored on this element is a nonlinear function of 
the voltage across the element. 

• Nonlinear transcapacitance The charge stored on this element is a nonlinear function 
of a voltage somewhere else in the equivalent circuit. 

As examples of the basic nonlinear elements listed above, we discuss the nonlinear con- 
ductance and the multidimensional transconductance below. 

The general Taylor series description of the AC output current of a nonlinear conduc- 
tance expanded around the DC operating point is given by 



t'out(f) = f(V + v(t)) - /(V) = 

k= 1 



1 d k f(x(t)) 
k\ dx k 



x=V 



V k (t). 



(5.8) 



Here, V is the DC operating point and v is the AC part of the voltage across the conduc- 
tance. Usually it is sufficient to keep only the first three terms of the sum in (5.8). The 
term corresponding to k = 1 is simply the linear conductance and is given by 



g = 



d fix) 
d.r 



— al g . 

x=V 



(5.9) 



The second- and third-order terms are labeled a 2 g and a 3 g , respectively, and can be 
written as 



a2 g 



a3 g 



1 d 1 f(x) 
2! djt 2 

1 d 3 fix) 
3! djc 3 



x=v 



x=V 



(5.10) 

(5.11) 



Now if we are interested in the third-order harmonic amplitude HD? g of the output current, 
it can be calculated from the definition given in (5.4) as follows: 



HD, g = 



1 , 2 l a3 gl 
4 1 ’ | al g \ ' 



(5.12) 



Hence, to accurately model HD?, of the nonlinear conductance, the third-order derivative 
of the conductance with respect to the voltage across it has to be accurately described as 
mentioned in the introduction section of this chapter. 

Now let us turn to the multidimensional transconductance. To illustrate the added 
complexity of this element, we choose a transconductance with two controlling voltages. 
The AC output current of this element can be written in terms of a Taylor expansion at 
the DC operating point U, V as follows: 



OO k 

iout(0 - f(U + u(t), V + v(t)) - f(U , V) = EE 

k= 1 /=0 



d k fix, y) u' v k ~‘ 
dx'dy k ~‘ i ! (k — i)\ 



(5.13) 
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If we keep only terms up to and including third order, (5.13) can be simplified to 



9/ , 1 9 2 / 



1 9 3 / 



9/ , 1 9 2 / 



19 3 / 



' />\ /-v, I J Z i ~ J j * J . ~ J Z i ~ J 

*out(0 ~ — U + yU +-——U +— V+-—TV +-—T V 



dx 



2 dx 2 



6 dx 3 



dy 2 dy 2 



6 dy 3 



,9/ 9/ , 1 9 2 / 9/ 2 , 1 9/ 9 2 / 2 

H uv H — u v -\ -uv 

dx dy 2 dx 2 dy 2 dx dy 2 

— al u u + a2 u ii 2 + a 3„« 3 + al„t> + a2 v v 2 + a3 v v 3 

+ a2 uv uv + a3 2uv u 2 v + a3 ll2v uv 2 . 



(5.14) 



Here we have added a subscript to the nonlinearity coefficients to indicate for which 
voltage the derivative has been taken. 



5.3 NONLINEARITIES IN CMOS DEVICES 
AND THEIR MODELING 

To be able to analyze the nonlinear behavior of integrated circuits, the nonlinearity of the 
devices involved must be known. All devices used in integrated CMOS circuits are more 
or less nonlinear. Here we will only discuss MOS transistors to illustrate the modeling 
requirements for accurate analysis of distortion effects since the same approach can be 
applied to other devices as well. 

Equivalent circuits for MOS transistors include the following types of nonlinear ele- 
ments: nonlinear conductance, multidimensional nonlinear transconductance, and nonlin- 
ear transcapacitance. 

Let us use the well-known square law model 1 for the drain current of MOS transistors 
and derive expressions for the AC part of the drain current. If we assume that the device 
is biased in the saturation region above threshold, the drain current is given by 



h = K(V gs - V t ) 2 (1 + AV ds ), 


(5.15) 


Vt = Vto + Y (V <t> — V bs — > 


(5.16) 


1 W 




K ~ 2 MCox T 


(5.17) 



Here, Vj is the threshold voltage, A is the channel-length modulation parameter, V/o is 
the threshold voltage at zero bulk-source voltage, y is the body effect parameter, / is 
the bulk Fermi potential, /i is the field effect mobility, C ox is the oxide capacitance, W 
is the gate width, and L is the gate length. From the model equations above we note 
that we have three voltages that constitutes the independent variables of our nonlinear 
model, Vg S , Vds. and V bs . Following the approach described in the previous section, we 



1 The square law model is the level 1 MOS transistor model in SPICE, which was first published in Shichman 

and Hodges (1968). 
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can expand (5.15) into a Taylor series that gives the following expression for the AC part 
of the drain current: 

id = g m v gs + a2 gm v 2 gs + a3 gm v^ + ■■■ 

+ gdsTds + a2 gda vj s + a3 gds vl + ■■■ 

+ gmbVbs + a2g mb Vbs + a 3 ft „ b t)bs H 

a 2gmgds V gS t’ds + a ^2g mgds V gs Vds + a 3 Sm 2g ds TgsTds + ' ' ' (5.18) 

+ a ~gmgmb v gs^bs + a ^2g m g mb V ss Vh & + a 3g m 2g mb Vg S Vbs + ■ ■ ■ 

Cl^ gd s g m h TdsTbs + a 32g ds g mb l>J s Tbs + a ^gd s 2g mb TdsTb s + ■ ■ ■ 

3” fl 3g m g ds g mb ng S l>dsTbs + ’ ’ ’ • 



In (5.18) we have used conductance symbols instead of voltages as subscripts to indicate 
for which voltage the derivative has been taken. The expressions for the nonlinearity 
coefficients up to and including third order are given by 



8m 



a 2 



8m 



a3 



gm 



8 ds 



a 2 



8ds 



a 3 



£ds 



8 mb 



a ^gmb 



a ^gmb 
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gmgds 



a ^2 g mgds 



dl d 
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1 9 2 /d 
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1 9 3 / d 



6 9V| 

9/ d 



3V ds 

1 9 2 4i 



= 2£(V gs - V T )(l+kVds), 
= K( 1 + XVds), 

= 0, 

= K(V g S - Vt) 2 A, 

= 0 , 

= 0 , 



2 ^d 2 s 

1 9 3 /d 
6 9^ds 

9/d 9/d 9 Vt //(Vg S - Vrld+A-Vds)/ 



9V bs 9y T 9V b s 



V0 ^ 



1 9 2 / d = = Ky(V gs -V T0 + y^)(l+XVds) 

2dV 2 8V bs 4(0 - v bs ) 3/2 

1 9 3 /d = 3Ky(V g& - Vjq + y V0)(1 + AU ds ) 

6 9 V b 3 s 8(0 - U bs ) 5/2 



9/ d 9/d 
9Vg S 9Vds 



- 2 K( V gs - V T )k, 



1 9 2 /d 3/d 

2 9 V g 2 s 9 Vds 



(5.19) 

(5.20) 

(5.21) 

(5.22) 
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(5.24) 

(5.25) 

(5.26) 

(5.27) 

(5.28) 
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KyX 



2 9 V ds 9 V bs (0 - V bs ) 3/2 ’ 
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9 Vg S 9 V ds 9 V bs V0 - Hs 



(5.30) 

(5.31) 

(5.32) 

(5.33) 

(5.34) 

(5.35) 

(5.36) 

(5.37) 



By examining the above equations, we notice that the AC current depends on the higher- 
order derivatives of the drain current, which was mentioned in the introduction section 
of this chapter. For example, the third-order harmonic amplitude related to the transcon- 
ductance contains the term 9 3 / d /(9 V gs ) 3 . Hence, to accurately estimate HD igm , the device 
model should accurately reproduce the third-order derivative of the drain current with 
respect to the gate-source voltage. Evidently, by looking at (5.21) we conclude that the 
MOSFET level 1 model does not perform well in predicting HD igm since a3 gm is identical 
to zero for this model. 

It is not only the quadratic model that has problems with reproducing higher-order 
derivatives. Also, modem state-of-the-art models like the MM9 model from Philips (see 
Chapter 8) and the different BSIM3 versions also struggle to provide accurate estimates 
of the higher-order nonlinearity coefficients. These problems are not only caused by the 
expressions used in the model but also by the parameter extraction procedures employed 
to generate the parameter sets for the model. A thorough and excellent discussion of 
the performance of commonly used MOSFET models in terms of accuracy in predicting 
distortion effects is given in Wambacq and Sansen (1998). As an example of poor esti- 
mation of higher-order derivative, we show in Figure 5.1 the simulated transconductance 
and the third-order derivative of the drain current with respect to the gate-source voltage 
a 3 gm versus the gate-source voltage for a 0.5-p.m-long MOS transistor having a drain- 
source voltage of 3.0 V using the BSIM3v2.0 model. The calculated a3 gm exhibits several 
discontinuities when the gate-source voltage is around 1 V. 

Several MOSFET models have been published that address the problems discussed 
above. These models provide enhanced accuracy in estimating the nonlinearity charac- 
teristics of modern deep submicron MOS transistors. Here, we would like to mention 
one such model that was published in van Langevelde (1998). In this work, the author 
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Figure 5.1 Simulated transconductance and a 3 gm of a 0.5-|xm-long MOS transistor having a 
drain-source voltage of 3.0 V using the BSIM3v2.0 model 



presented a MOSFET model with emphasis on accurately predicting distortion effects in 
deep submicron MOS transistors. The model is also suitable for implementation in circuit 
simulators such as SPICE. According to the author, the following requirements should be 
satisfied for such a model: 

• The model should include physics-based description of short-channel, high-held, and 
other effects that become important in deep submicron devices such as, mobility degra- 
dation, series resistance, velocity saturation, drain-induced barrier lowering, channel- 
length modulation, hot-carrier effects, and non-quasi-static effects. 

• The model should be symmetric around Vds = 0 in all quantities including higher-order 
derivatives. 

In addition to the points above, we would like to add that the parameter extraction 
procedure should be part of the model and developed in parallel with the development 
of the model equations. Parameters should be extracted on the basis of measurements of 
not only currents, g m and g ds , but also of higher-order derivatives. 

All nonlinearities discussed so far have been frequency-independent. Nonlinearities 
that arise from memory elements, such as capacitances, increase with frequency since 
the impedance of such elements decrease with frequency. In Figure 5.2 we show mea- 
sured HD 2 and HD 2 of a CMOS two-stage Miller-compensated OPAMP configured as an 
inverting amplifier using feedback. We note that the distortion at low and moderate fre- 
quencies is low and almost constant. Then, around 10 kHz the two harmonic amplitudes 
suddenly start increasing at a rate of 20 dB per decade. This sudden increase is caused 
by capacitances inside the amplifier. In Hernes (2003) a thorough nonlinear analysis of 
the feedback-folded cascade Miller OPAMP is carried out. The authors presented closed- 
form equations for the second and third harmonics as a function of frequency and circuit 
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Frequency (Hz) 

Figure 5.2 Measured HDi and HD? of a CMOS two-stage Miller-compensated operational ampli- 
fier configured as an inverting amplifier using feedback. Reproduced from Wambacq P., Gielen 
G. G. E., Kinget P. R., and Sansen W. (1999) High-frequency distortion analysis of analog inte- 
grated circuits, IEEE Trans. Circuits Syst. II: Analog Digital Signal Process., 46(3), 335-345 (© 
1999 IEEE) 

element parameters. Also, an optimization procedure for low linearity in feedback circuits 
was described. 



5.4 CALCULATION OF DISTORTION IN ANALOG 
CMOS CIRCUITS 

In the previous sections, we have explained how to calculate the distortion characteristics 
of single transistors. In this section, we will briefly discuss important methods commonly 
employed to calculate distortion in analog CMOS circuits, such as OPAMPS, transcon- 
ductors, and RF amplifiers. The methods employed are based on either numerical or 
symbolic techniques. 

Probably the most straightforward numerical method for the calculation of harmonic 
components of circuit responses is to ran a time-domain simulation in, for example, SPICE 
(Quarles et al. 1994) and then perform a Fast-Fourier transform (FFT) to calculate the 
spectrum of the output signal(s). This approach has several disadvantages. Usually the 
circuit exhibits transients at the beginning of the simulation before it reaches steady state. 
The sequence of time points that are used as input to the FFT algorithm must be free of 
transients, and hence, the time-domain simulation must be run at least one period of the 
output signal plus the time required to reach steady state. For circuits that exhibit widely 
separated time constants, the transients remain active for a long time and the CPU time 
for a single simulation may become uncomfortably long. Example of such circuits are 
high-Q filters and RF mixers. 
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Two methods that do not suffer from the above problems are the shooting method 
(Aprille and Trick 1972; Skelboe 1980) and the harmonic balance (HB) method (Cun- 
ningham 1958; Mees 1981). In the former method, several time-domain simulations are 
run over a single period. On each iteration the initial conditions are varied to match the 
response at the end of the simulation interval. When they match, the initial conditions 
found do not cause transients and the steady state response of the circuit is found. 

The HB method is a frequency-domain technique in which the steady state response of 
the circuit is calculated directly. Today the most common frequency-domain technique for 
steady state analysis implemented in circuit simulators is the direct calculation of the coef- 
ficients of a trigonometric series (Kundert et al. 1990). Applying this method to circuits 
containing only linear devices is equivalent to a phasor analysis, in which superposition 
is utilized for each device. However, the introduction of nonlinear devices complicates 
matters. Superposition does not apply any more, and even worse, the calculation of the 
coefficients of a single device response is no longer a simple task of directly transforming 
the coefficients of the input stimulus. A common solution to this problem is to calculate 
the response coefficients approximately by transforming the stimulus, the node voltages, 
and the branch currents into the time domain, where all nonlinear devices can easily be 
evaluated. The results are then converted back into the frequency domain in order to 
obtain the coefficients of the response. 

Frequency-domain techniques based on a direct calculation of trigonometric series 
coefficients are referred to as harmonic balance methods (see, for example, the review 
articles by Rizzoli and Neri (1988); Gilmore and Steer (1991a, 1991b)). The first term 
(harmonic) is obvious because trigonometric series are used. The second term (balance) is 
mostly for historical reasons. It stems from an approach based on balancing the currents 
and voltages at the boundaries between linear and nonlinear subcircuits. 

All methods discussed above are numerical and usually do not present the results in a 
way such that designers can extract the devices that dominate the nonlinearity characteris- 
tics of the circuit. Numerical methods based on so-called Volterra series (see, for example, 
Schetzen (1980)) that provide such insight have been proposed by many researchers 
(see, for example, Weiner and Spina (1980)) and have also been implemented in SPICE 
(Chisholm and Nagel (1973); Roychowdhury (1989)). A disadvantage of methods based 
on Volterra series is that they require the circuits to be weakly nonlinear. Weakly non- 
linear means that the input amplitudes should be small. As the amplitudes are increased, 
the analysis based on the weakly nonlinear assumption becomes less and less accurate. 
Since it is generally not possible to estimate for what amplitudes the assumption fails, 
other numerical methods are used to verify the results. 

Symbolic methods are also usually based on Volterra series, and as mentioned above, 
are accurate only for weakly nonlinear circuits. Now, you may ask why we need symbolic 
methods after all. The answer is that the results obtained from symbolic methods give 
much better insight into the nonlinear characteristics of the circuits since such methods 
provide closed-form analytical expressions containing contributions of basic nonlinearities 
that are dominant in the harmonic or intermodulation amplitudes. As a result, it is possible 
to easily identify which devices dominate the nonlinear response and how to modify sizing, 
biasing, and topology to reduce the distortion components. In general, this is not possible 
in numerical methods. Work published on symbolic analysis concentrate on distortion in 
OPAMPS. In Wambacq et al. (1999), the authors present an approach based on Volterra 
series for the analysis at high frequencies of the nonlinear behavior of weakly nonlinear 
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circuits with one input port, such as amplifiers, and with more than one input port, such 
as analog mixers and multipliers. The complexity of the representation of frequency- 
dependent distortion based on Volterra series grows rapidly with the size of the circuit. In 
Hernes (2003), the authors present a more effective method based on the phasor method 
(see, for example Wambacq and Sansen (1998)) instead of Volterra series. By using this 
approach, the authors showed that the complexity of the representation of distortion in 
CMOS circuits can be drastically reduced compared to previous methods. 

In general, circuits exceeding a few transistors, symbolic hand calculation are not feasi- 
ble. Instead, symbolic network analysis programs are utilized to automate the calculations. 
Both programs developed in commercially available symbolic software packages, such 
as Mathematica ( http://www.woIfram.com/ ) and Maple ( http://www.maplesoft.com/ ), and 
stand-alone software programs have been developed. In Hernes (2003), Maple was used 
for automating the symbolic calculations. The computer program ISAAC by Gielen et al. 
(1989) is an example of a stand-alone program that requires no other software packages to 
be installed on your computer system. A copy of the program can be obtained by visiting 
their Web site at http://www.esat.kuleuven.ac.be/micas/Software/ . 
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The BSIM4 MOSFET Model 



6.1 AN INTRODUCTION TO BSIM4 

Since MOSFET was invented, many physical effects have been studied, such as (1) vertical 
and lateral nonuniform doping effect (Hori et al. (1993)); (2) drain-induced barrier-low- 
ering effect (Troutman (1979)); (3) normal and reverse short-channel effect (Viswanathan 
et al. (1985); Hsu et al. (1991); Rafferty et al. (1993)); (4) normal and reverse narrow 
width effect (Akers et al. (1982); Cheng et al. (1997b)); (5) field-dependent mobility and 
velocity saturation (Sodini et al. (1984); Talkhan et al. (1972)); (6) channel-length mod- 
ulation (Frohman-Bentchknowsky and Grove (1969)); (7) impact ionization (Arora and 
Sharama (1991)); and (8) polysilicon gate depletion (Huang et al. (1993)). These well- 
known physical effects have been included in various MOSFET models such as BSIM3v3 
model (Cheng et al. (1997a)). As CMOS technology approaches more advanced levels, 
many novel physical effects appear, such as (1) gate-induced drain leakage (Chen et al. 
(1987)) and gate direct tunneling leakage (Majkusiak (1990)); (2) inversion layer quan- 
tization (King et al. (1997)); (3) finite charge layer effect (Fiu et al. (1999)); (4) HF 
influence of MOSFET parasitics (Cheng et al. (2002)); and (5) asymmetric source/drain 
resistance. In BSIM4 (Fiu et al. (2000)), these new physical effects are modeled. Also, 
modeling for some physical effects included already in BSIM3v3 have been revisited in 
order to improve the model accuracy. 

6.2 GATE DIELECTRIC MODEL 

In today’s MOSFETs, in which the gate oxide thickness is very thin (<3 nm), the effect of 
channel quantization, determining the finite charge layer thickness (FCFT) in the channel, 
becomes nonnegligible (King et al. (1998)). BSIM4 accounts for this effect in both DC 
and capacitance model. To activate this effect of FCFT in the simulation, two of the 
following three can be used as input model parameters: (a) electrical gate oxide thickness 
TOXE, (b) the physical gate oxide thickness TOXP, (c) and their difference DTOX — 
TOXE — TOXP. On the basis of these parameters, BSIM4 models the FCFT effect by 
introducing an effective gate oxide capacitance C oxe ff in I-V and C—V models (Fiu 
et al. (1999)), 

„ C oxe ‘ C C en ,, , . 

Uoxeff — "T; — (O.ij 
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where C OX e and C ce n are called effective gate oxide capacitance and centroid channel 
charge capacitance, respectively, and can be given as follows: 



Ooxc 

6-cen 



EPSROX ■ s 0 
TOXE 

£si 

^DC 



( 6 . 2 ) 

(6.3) 



where EPSROX is a model parameter, with a default value of 3.9, for the gate dielectric 
constant relative to vacuum; TOXE is the parameter for the equivalent electrical gate 
oxide thickness; e 0 and e s j are the permittivity of free space and silicon, respectively; 
X DC is the equivalent DC centroid of the channel charge layer and is given by 



^dc — 



1.9 x 10 9 cm 



Vgsteff + 4 ■ ( 1/77/0 - VFB - 0 s ) \ 
2 ■ TOXP ) 



(6.4) 



where VTHO and VFB are model parameters for the long-channel threshold voltage at 
V BS = 0 and the flat-band voltage, respectively; TOXP is the parameter for the equivalent 
physical gate oxide thickness; Vgsteff is similar to the one used in BSIM3v3 but introduces 
additional parameters to improve the model accuracy in moderate inversion region. 



V, 



gsteff — 



n ■ V t ■ In 


1 + exp 


'm*(V gse - Vh)! 




n ■ V t 


m* + n ■ C oxe I 


2$ 




r (1 -m*)(V gse - V th )- V 0 ' ff ] 


q ■ NDEP ■ e si 


n ■ V t 



(6.5) 



where m* = 0.5 + arctan(MINV)/n, MINV is a model parameter introduced to improve 
the model accuracy in moderate inversion region; n is the subthreshold swing parameter; 
Vg Se is the effective gate voltage with the consideration of polysilicon gate depletion effect 
(Cheng et ai. (1997a)); V th is the threshold voltage; V ( ' ff is a potential offset parameter 
(Cheng and Hu (1999)) and equals to VOFF + VOFFL/L e ff to describe the channel-length 
dependence of Vff in devices with nonuniform doping profiles; L eff is the effective channel 
length; V t is the thermal voltage and is equal to k^T/q, <J> S can be considered the surface 
potential defined as 



k y J / NDEP \ 

0 S = 0.4 + — In + PHIN 

<1 V «i / 



( 6 . 6 ) 



where NDEP is doping concentration at the edge of the channel depletion layer at V B s = 0, 
n, is the intrinsic carrier concentration in the channel region, and PHIN is a model 
parameter to describe the nonuniform vertical doping effect on surface potential. 

High-k gate dielectric can be modeled as SiCH (relative permittivity: 3.9) with an 
equivalent SiCH thickness. For example, 3-nm gate dielectric with a dielectric constant 
of 7.8 would have an equivalent oxide thickness of 1.5 nm. BSIM4 also allows the user 
to specify the model parameter (EPSROX) for gate dielectric constant different from 3.9 
(SiCH) as an alternative approach to modeling high-k dielectrics. Figure 6.1 illustrates the 
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Figure 6.1 Illustration of the options for gate dielectric models in BSIM4 

algorithm and options for specifying the gate dielectric thickness and calculation of the 
gate dielectric capacitance for BSIM4 model evaluation. 



6.3 ENHANCED MODELS FOR EFFECTIVE DC 
AND AC CHANNEL LENGTH AND WIDTH 

Two additional fitting parameters XL and XW are introduced in BSIM4 to account for 
the offset in channel length and width due to the processing factors such as mask and 
etching. In DC case, the effective channel length and width can be given 



L e ft — ^ drawn T XL 2 ■ (1L . 

LW 



d L 

Weff — ITdrawn 



LL 

LINT + —777 + 



WL 

dW = WINT+ —— + 



l_LLN \yLWN 

+ XW — 2d W, 

ww 



+ 



LWL 



ILLN \yLWN ’ 

WWL 
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]JVLN 1 yyWWN ' j^WLN yyWWN 



+ DITGVgsteff + DWB 



| \ T Thscff 



(6.7) 

( 6 . 8 ) 

(6.9) 



( 6 . 10 ) 



where LINT, LL, LW, LWL, LLN, and LWN are model parameters to describe the geometry 
dependence of d L , in which LINT can be considered as the traditional “A L” extracted 
from the intercept of a straight line in an i? DS versus Ldrawn plot; WINT, WL, WW, WWL, 
WLN, and WWN are model parameters to describe the geometry dependence of AW, in 
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which WINT can be considered as the traditional “AW” extracted from the intercept of 
a straight line in a 1/ifos versus Wdrawn plot; Wdrawn here and in Eqs. (6.13) and (6.15) 
stands for a channel width for a single-finger device. In a multifinger device, Wd raw n is the 
channel width per finger, that is, Wdrawn = WC^j/Ap, where W tot ai is the total width of the 
device and iV F is the finger numbers of the device. The DWG and DWB parameters are 
used to account for both the gate and substrate bias effects; Vg S t e ff and Vb se ff are effective 
gate and substrate biases. Figures 6.2 and 6.3 illustrate the definitions of XL , XW, dW, 
dL, W e ff, L e ff and /Vp. 

In AC case, the effective channel length and width are represented by L ac tive and W act i V e, 
defined as 



Tapfivf. — L 



drawn 



+ XL - 2 ■ dLC, 



LLC LWC LWLC 
dLC — DLC + —7777 + 77777777 + 



J^LLN ' yyLWN ' j^LLN \yLWN ' 



(6.11) 

(6.12) 
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Figure 6.2 Definitions of XL, XW, dW, dL, W e fj. and L e ff 




Figure 6.3 Definitions of iV F , Wd raw n. and W to t a i 
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Figure 6.4 Definitions of DWJ and Weffcj 



Wacive = W drawn + ZW - 2dWC, 

WLC WWC WWLC 
dwr = DWC A 1 1 

1 j^WLN 1 yyWWN 1 j^WLN y/WWN ’ 



(6.13) 

(6.14) 



where DLC, LLC, LWC, and LWLC are model parameters to describe the geometry depen- 
dence of dLC, which is similar to d L in DC but is a closer representation of the parasitic 
portion of the physical channel length at one side of the device than dL; DWC, WLC, 
WWC , and WWLC are model parameters to describe the geometry dependence of dWC. 

Please note that the calculation of dLC and dWC are triggered by including DLC and 
DWC or the length/width dependence parameters (LLC, LWC, LWLC, WLC, WWC, and 
WWLC) in the model file. As a default, DLC = LINT, DWC = WINT, and DWC, DLC, 
LLC, LWC, LWLC, WLC, WWC, and WWLC are set to the values of their DC counterparts. 

As shown in Figure 6.4, BSIM4 also introduced a new parameter DWJ to calculate the 
effective source/drain diffusion width, Weffcj, in modeling parasitics such as source/drain 
series resistance, gate resistance, gate-induced drain leakage (GIDL), and so on. 



Wef&j = W drawn - 2 ■ dWL, 

„ / WLC WWC 

— Wfrawn — 2 ■ DWJ + —777777 + 7777777777 + 



WWLC 



j^WLN 1 ppWA ' LWLNyyWWN 



(6.15) 



6.4 THRESHOLD VOLTAGE MODEL 

6.4.1 Enhanced Model for Nonuniform Lateral Doping 
due to Pocket (Halo) Implant 

To reduce the short-channel effects (SCE), local high-dose implantation near the source/ 
drain region edges have been employed. This is called lateral channel engineering or 
pocket (Halo) implantation, which causes higher doping concentration near the source/ 
drain junctions than that in the middle of the channel as shown in Figure 6.5. A V th 
roll-up, sometimes called reverse short-channel effect (RSCE), will usually happen as 
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Figure 6.5 Doping profile along the channel in a MOSFET with pocket implant 




Figure 6.6 Illustration of V [lt roll-off effect. Reproduced from Lu, C.-Y. and Sung J. M. (1989) 
Reverse short-channel effects on threshold voltage in submicrometer salicide devices, IEEE Electron 
Device Lett., 10, 446-448 



the channel length becomes shorter since the effective channel doping concentration gets 
higher (see Figure 6.6). The pocket implantation will not only cause nonuniform lateral 
doping (NULD) but will also contribute to a nonuniform vertical doping, so the body 
effect will be influenced as well. A simple equation to consider the lateral nonuniform 
doping has been derived in BSIM3v3 (Cheng etal. (1997a)). It can model the V* roll- 
off reasonably well as shown in Figure 6.7. An empirical term has been introduced in 
BSIM4 to improve the model accuracy with the consideration of the influence of NULD 
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Figure 6.7 Comparison of measured data and BSIM3v3 model to simulate the Vih roll-off 
effects. Reproduced from Cheng Y., Sugii T., Chen K., and Hu C. ( 1997b) Modeling of small size 
MOSFETs with reverse short-channel and narrow width effects for circuit simulations, Solid-State 
Electron., 41(9), 1227-1231 

to the body effect, 



V* = VTHO + K 1 • (J 



<t> s — ^BS 




K2 ■ V BS 



+ K\ ■ 




LPEO 

Uf 




■ 



(6.16) 



where VTHO , K I , K 2, LPEO , and LPEB are model parameters. LPEO replaces the NLX 
parameter in BSIM3v3 and LPEB is a new parameter introduced in BSIM4. 

Besides the implants discussed above, it has been known that the pocket implant also 
causes so-called drain-induced threshold voltage shift (DITVS) in long-channel devices, 
which is described by the following equation in BSIM4: 



A Vth (DITVS) = —n ■ V, ■ In 



J eff 



L eff + DVTPO ■ (1 + e 



-DVTP 1 ■ 



Vds) 



(6.17) 



where DVTOO and DVTPl are model parameters, L e ff is the effective channel length, and 
Vds is the drain-to- source bias. 



6.4.2 Improved Models for Short-channel Effects 

Compared with the long-channel devices, short-channel devices show a significant depen- 
dence on channel length and drain voltage. Various models have been developed to 
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describe the so-called short-channel effects. Typically, it can be summarized by the fol- 
lowing equation: 

AV*, SCE = AV th (L eff ) + AVth(V DS ) (6.18) 

where A Vth ( L cff ) is the change in threshold voltage caused by the SCE without the 
influence of drain/source voltage V DS and AV lh (V DS ) is the change in threshold voltage 
caused by the influence of Vds- 

In a short-channel device with nonzero Vds, the depletion layer in the channel region 
will be modulated as drain bias varies, as shown in Figure 6.8. On the basis of a quasi- 
two-dimensional analysis of the Poisson equation, the influence of SCE on the threshold 
voltage can be described by the following equation (see Liu et at. (1993)): 



AV th , SCE = -0 th (L eff )[2(V bi - <D S ) + V DS ] (6.19) 



where V b i is the built-in voltage of the source/drain junctions and 6 rb ( L ett j is given by 



6th ( Left) — 



1 



2 cosh 




- 1 



( 6 . 20 ) 



where l t is called the characteristic length (see Cheng el al. (1997a)). 

Unlike BSIM3v3, in which an approximated form of Eq. (6.20) has been used (Cheng 
et al. (1997a)), BSIM4 implemented Eq. (6.20) without approximation. However, a set 
of fitting parameters similar to those in BSIM3v3 has been introduced to increase the 
model flexibility for different technologies, such as DVTO , DVT I , DVT2, DSUB. ETAO, 
and ETAB , as shown in the following model equations: 



AW SCE = - 



where 



DVTO 



2 ■ cosh (dVTI ■ j - 1 



(Vm - 3> s ) - 
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Figure 6.8 An increase in depletion layer caused by the applied drain bias 
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■ TOXE ■ Z dep 
EPSROX 



+ DVT2 ■ Vbs)> 



ho = 




• TOXE ■ Z dep0 
EPSROX 



(6.23) 

(6-24) 



where X dep is depletion layer width in the channel with the influence of V bs ; X dep is 
depletion layer width in the channel when V B s = 0, 
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(6.25) 

(6.26) 



Figure 6.9 shows the model results with SCE only, with NULP effect, and with both 
SCE and NULP effects. It clearly demonstrates that the V* roll-off is a combined result, 
contributed by both SCE and NULP. 



6.4.3 Model for Narrow Width Effects 

Narrow width effect was found initially in devices with LOCOS isolation, in which addi- 
tional contribution of charges in the depletion region in the edge of the held implant 
region will impact the threshold voltage of the device as shown in Figure 6.10. As the 
channel width decreases, the influence becomes significant, resuiting in a higher threshold 
voltage. This behavior is called “normal” narrow width effect. In contrast with the “nor- 
mal” narrow width effect, the so-called “reverse” narrow width effect has been reported 
in devices with shallow trench isolation (STI) as illustrated in Figure 6.11. A decrease in 
threshold voltage has been observed as the channel width decreases. 




Figure 6.9 Illustration of BSIM4 threshold voltage model in predicting different characteristics 
of Vth versus L in MOSFETs with different channel/pocket doping conditions 
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Polysilicon 




Figure 6.10 Illustration of MOSFET with LOCOS isolation 




Figure 6.11 Illustration of MOSFET with STI (shallow trench isolation) 



To model narrow width effects, similar to BSIM3v3, BSIM4 adopts an empirical 
approach based on the following general form for narrow width effects: 



. T , x OX T 

AV t h,w oc ——Os. 
W eff 



(6.27) 



By introducing several fitting parameters, K3, K 3 B , WO, DVTOW , DVTIW and 
DVTIW, the narrow width effect is modeled in BSIM4 by 
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Figure 6.12 Calculated result of BSIM3v3/BSIM4 V th model in predicting characteristics of V* 
versus channel width with extracted different model parameters 



where the second term accounts for the narrow width effect in small devices with both 
narrow width and short-channel length. 

Figure 6.12 shows the calculated result of threshold voltage versus channel width with 
different model parameters for both “normal” and “reverse” narrow width effects. 



6.4.4 Complete Threshold Voltage Model in BSIM4 

The complete BSIM4 threshold voltage model is 
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(6.29) 
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where 



TOXE 

K lox = K 1 , 

TOXM 

TOXE 

K lm = K 2 , 

TOXM 

Vbseff = V bc + 0.5 • [(Vbs - Vbc - Sr) + V (Vbs — Vbc — <5 i) 2 — 4 ■ <$i ■ Vbc, 



(6.30) 

(6.31) 

(6.32) 



where <5i = 0.001, Vbc is the upper bound of Vb se ff and is defined by the following equation: 

( K\ 2 \ 

y bc = 0 - 9 -(^-^j- (633) 

The temperature dependence of V* is modeled by 

/ KT\L \ ( T \ 

Vth (T) = V th (TNOM) + hm + -j— + KT2 ■ V bseff j ■ ~ 1 j (6-34) 

where ATI and ATI L and KT2 are model parameters. 

Figure 6.13 illustrates the dependence of V t h on temperature. 



6.5 CHANNEL CHARGE MODEL 

BSIM4 uses a unified expression for the channel charge Q cb from the strong inversion 
and subthreshold regions. The equation of the unified charge model in BSIM4 is similar 
to that in BSIM3v3, but additional improvements have been introduced to enhance the 
model accuracy in the transition region from the subthreshold to strong inversion. With the 
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Figure 6.13 Calculated characteristics of V th versus temperature 
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consideration of the influence of both drain and body biases (the details of the derivation 
can be found in Cheng and Hu (1999); Cheng et al. (1998)), the channel change density 
along the channel, at a location y, can be described by 



Qch(y) — Coxeff ‘ Cgxtefr * 



VKy) 

v b 



(6.35) 



where C oxe ff and Vg S teff have been discussed earlier. Vr(y) is the quasi-Fermi potential 
at given point y along the channel with respect to the source; the expression of Vj, is 
given by 



V b 



Vgsteff + 2 ■ V t 
^bulk 



(6.36) 



where Ab u ik is a factor to describe the bulk charge effects and is expressed as 



Abuik — 1 + fi 



Aq ' Ceff 



NUD ' 



■V. 



gsteff ' 



L e ff + 2 • yJXJ • X dep 
2 

i + 



^eff 



, Ceff + 2 ■ JXJ ■ X deP/ 



(1 - AGS ) 



50 

Kn+B\ 



(6.37) 



1 



1 + KETA ■ Vbseff 



where AO, AGS, BO, 51, and KETA are model parameters; 5 NUD is used in BSIM4 to 
model the nonuniform doping effects, 



5nud = 



\/l + LPEB/ L e ffK iox 

- x/T, Vbseff 



+ K 2ox - K3B 



TOXE 

7 <t> s . 

w eS ' + wo 



(6.38) 



Equation (6.35) can become a well-used piecewise equation in strong inversion and 
subthreshold regions, respectively, as shown in Figure 6.14 (Cheng et al. (1998)). In 
strong inversion region, threshold voltage, V*, and bulk charge factor, Abuik, are the main 
factors determining the channel charge characteristics. In subthreshold region, besides 
these two factors, the subthreshold swing factor, n, and the potential offset parameter, V^ ff , 
are also important for influencing channel charge and hence the subthreshold conduction. 

Both Voff and n have been mentioned in previous section. The expression of Vj ff 
has been introduced earlier also when discussing Vg Ste ff ■ n is subthreshold conduction or 
subthreshold swing factor. Basically, it is a function of the channel length and the Si-SiCT 
interface property. In BSIM4, the n factor is given by 



n = 1 + NFACTOR ■ — + 

Coxe C, 



err , ( cdsc + c riser) ■ v DS + cdscb ■ v bseif ) ^ 

1 7 7 7 (6.39) 



2 ■ C oxe ■ cosh ( DVTl ■ — ) - 1 

It 



where NFACTOR, CIT, CDSC, CDSCD, CDSCB are model parameters. The second term 
is the portion of the subthreshold swing factor derived for long-channel devices, with 
NFACTOR introduced. The third term is to model the contribution of the interface state. 
The last term is to describe the contribution of the coupling between drain/source and 
channel as shown in Figure 6.15. The capacitance coupling effect is described by an 
exponential function of the channel length. CDSCD and CDSCB are model parameters 
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Figure 6.14 Channel charge model fits the measurement data taken from devices with different 
lV C h well. The model covers weak, moderate, and strong inversion regions of nMOSFETs. Repro- 
duced from Cheng Y. et al. (1998) A unified MOSFET channel charge model for device modeling 
in circuit simulation. IEEE Trans. Computer-Aided Design Integrated Circuits Syst.. 17, 641-644 
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Figure 6.15 Illustration of the gate oxide capacitance, channel depletion capacitance, and Cd sc 
coupling capacitances 
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introduced to describe the drain and body-bias dependence of the capacitance coupling 
effect. Note that it shares the DVT\ parameter for DIBL effect in threshold voltage model 
to describe the length dependence. 



6.6 MOBILITY MODEL 



Three scattering mechanisms, phonon scattering, coulomb scattering, and surface rough- 
ness scattering, have been found to explain the mobility behavior and account for the bias 
and geometry dependence of mobility. Each mechanism may be dominant under some 
specific conditions of the doping concentration, temperature, and biases. For example, 
mobility depends on the gate oxide thickness, channel doping concentration, thresh- 
old voltage, gate and bulk biases. A unified mobility model is given by Sabnis and 
Clemens (1979): 



_ Mo 
Meff ~ 1 + (E eS /E 0 y 



(6.40) 



where // o is the low-field mobility, Eq is called critical electric field, v is a constant, 
the value of which depends on the device type and technology. £ e ff is an effective field 
defined empirically by 



j-, Qb T Qinv/2 
E e f[ = 



(6.41) 



£si 



where Qn and <2 inv are the charge density in the bulk and in the channel, respectively. 

BSIM4 provides two mobility models that have been used in BSIM3v3. In addition, it 
also introduces a new mobility model based on the universal model given in Eq. (6.40). 
Users can select a different mobility model by defining a model parameter, mobMod, in 
the model parameter hie (also called model card). When mobMod — 0 and 1, mobility 
models from BSIM3v3 are used. When mobMod — 2, the universal mobility model is 
used. The detailed equations are given as follows: 

When mobMod = 0, 



Meff = 

1 + (U A (T) + U C (T ) ■ Vbseff) ‘ 



When mobMod — 1, 



Uo(T) 

/ Vgsteff + 2 ■ Vth \ 

V TOXE J 



+ U b (T) ■ 



{ Vgsteff + 2 ■ Vth \ 

V TOXE ) 
(6.42) 



Meff 



I ' 



Ua(T) ■ 



Uo(T) 

( Vgsteff + 2 ■ Vth \ 

V TOXE J 



+ U b (T) ■ 



( Vgsteff + 2 ■ V* 

V TOXE 



2 ' ( 1 T U c (T) Vbseff) 



(6.43) 



When mobMod — 2, 



Meff : 



U 0 (T) 



1 + (Ua(T) + Uc(T) ■ Vbseff) 



Vgsteff 



+ Co ■ (VTHO - VFB - O s ) 



-,EU 



(6.44) 



TOXE 
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Figure 6.16 V GS dependence of mobility behavior of different mobMod options 




Temperature (K) 

Figure 6.17 Calculated characteristics of mobility versus temperature 

where Co is a constant and it is 2 for ttMOS devices and 2.5 for /?MOS devices; UO, UA, 
UB, UC, and EU are model parameters. 

Figure 6.16 shows the calculated mobility behavior for different mobMod options to 
demonstrate the bias dependence of the mobility model when selecting different mobility 
model options. They approach a constant as Vgs tends to V* even though this number is 
not exactly equal to Uq. 
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The temperature dependence 



U 0 (T) 
U a (T ) 
U b (T) 
U C (T ) 



of mobility is described by the following equations: 

UTE 



= C/0- 



r y 

tnom) 



= UA + UA 1 ■ ^ ^ 



= UB + UB 1 ■ 
= C/C+ C/Cl ■ 



TNOM 

T 

TNOM 
T 

TNOM 



- 1 



- 1 



(6.45) 

(6.46) 
(6-47) 
(6.48) 



where UA, UB, and C/C are model parameters to describe the mobility behavior at tem- 
perature TNOM. 

Figure 6.17 shows the calculated characteristics of mobility versus temperature at sev- 
eral different V G s- 



6.7 SOURCE/DRAIN RESISTANCE MODEL 

The total source and drain series resistances in a MOSFET used in integrated circuit (IC) 
designs have several components such as the via resistance, the salicide resistance, the 
salicide-to-salicide contact resistance, and the sheet resistance in LDD region, as shown in 
Figure 6.18. However, the contact and LDD sheet resistances usually dominate the total 
resistance. The typical value of the sheet resistance is around 1 kO/sq in LDD region for 
a typical 0.25-p.m CMOS technology and much smaller in more advanced technologies. 

It has been known that the source/drain resistances are bias-dependent. In some compact 
models such as BSIM3v3 (Cheng et al. (1997a)), these bias dependencies are included. 




Figure 6.18 Different resistance components for the source/drain parasitic series resistances 
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However, since these parasitic resistances in BSIM3v3 are treated only as virtual compo- 
nents in the I-V expressions to account for the DC voltage drop across these resistances, 
they are invisible to the signal in the AC simulation. Therefore, external components for 
these series resistances need to be added outside an intrinsic model to accurately describe 
the HF noise characteristics and the input AC impedance of the device (Enz and Cheng 
(2000)). In BSIM4, similar to that in BSIM3v3, the source/drain series resistances are 
modeled by a bias-independent component and a bias-dependent component. However, the 
users have the option to select the source/drain resistance model according to their model 
applications to decide where to put the source/drain series resistances. In other words, in 
addition to the resistance model in BSIM3v3, which embeds the source/drain resistance in 
the I-V equation and assumes that series resistance at the source side equals the one at the 
drain side, BSIM4 introduces an asymmetric source/drain resistance model, which allows 
that the bias-dependent resistances at the source and the drain do not have to be equal 
and that they are physically connected between the external and the internal source/drain 
nodes, as shown in Figure 6.19. This asymmetric external source/drain resistance model is 
needed in simulating the high-frequency small-signal AC and noise behavior. A param- 
eter called rdsMod is introduced in BSIM4 to select different source/drain resistance 
models. When rdsMod — 0, symmetric source/drain BSIM3v3 resistance model is used 
and when rdsMod — 1, the external asymmetric source/drain resistance model is selected. 
The detailed equations are given below. 

When rdsMod = 0, 



^DS 



Rdswmin (T) + Rdsw ( T ) 



PRWB ■ (V 3 > s - V bS eff - V^) + 
CIO 6 ■ Weflcj)"* 



1 

1 + PRWG ■ V gs ^. 



(6.49) 

where PRWB, PRWG , and WR are model parameters; Rdswmin (T) and Rdsw(T) are the 
parameters with temperature dependence, given in the following: 



R dsw (T) = rdsw + prt ■ (— 1 ) , 

\TNOM ) 

Rdswmin (T) = RDSWMIN + PRT ■ ( — 1 

V TNOM 



(6.50) 

(6.51) 



where RDSWMIN and RDSW are model parameters to describe the source/drain resistance 
at TNOM\ PRT is a model parameter for the temperature dependence. 

When rdsMod — 1, 



Rdwmin(T) + Rdw(T ) 



Pn = 



-PRWB ■ Vbd + 



1 



1+/WG-(V gd - V fbso). 



Rswmin(T) + Rsw(T) 



Rs = 



N F ■ (10 6 ■ Weffej) 
-PRWB ■ Tbs + 



WR 



1 



l + Pi?WG-(V GS - Vfbso). 



N f ■ ( 10 6 ■ W effcj ) TO 



(6.52) 



(6.53) 
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Figure 6.19 (a) Intrinsic model without any series source/drain resistance; (b) symmetric source/ 

drain series resistance model rdsMod = 0; and (c) asymmetric external source/drain resistance 
model in BSIM4 ( rdsMod = 1) 



where VfbSD is the calculated flat-band voltage between the gate and the source/drain 
diffusion regions. 



VfbSD = 



KvT , 

In 

<7 



/ NGATE\ 
V NSD ) 



(6.54) 



where NGATE and NSD are model parameters for the doping concentration in the gate 
and source/drain regions. 
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In addition, diffusion source drain resistance 7? sdif f and ^ddiff are also modeled to 
account for the layout difference in source/drain regions in different MOSFET applica- 
tions. A parameter rgeoMod is introduced to turn on/off the components R s dig- and /Cciiti 
in simulation. When rgeoMod = 0, R s dif f and are set to zero; if rgeoMod — 1, 

^sdiff an d /'’ddiff will be calculated differently, depending on which geometry-dependent 
parameters are given. We will discuss some details later when introducing the layout- 
dependent parasitics. 

The temperature dependence of the RDWMIN, RSWMIN, RDW, and RSW are given 
by the following equations: 

R DW (T) = RDW + PRT ■ ( — 1 | , (6.55) 

\TNOM ) 

R SW (T) = RSW + PRT- ( — — 1), (6.56) 

\TNOM ) 

Rdwmin (T) = RDWMIN + PRT ■ ( - lj , (6.57) 

Rswmin (T) = RSWMIN + PRT • ( - - lj . (6.58) 



6.8 I-V MODEL 

The details of the derivation of the I-V model can be found in Cheng and Hu (1999), 
Huang et al. (1994), and Cheng et al. (1997a). Here, we focus our discussion on under- 
standing of the I-V model with different source/drain resistance options, without giving 
the derivation details of the I-V model equations. Depending on which resistance model 
is used in the model ( rdsMod — 0 or 1, and /?ds(V) = 0 or Rds(V) ^ 0), the I-V 
model and implementation are different. When Rds(V) — 0, the I-V model is used 
only for an intrinsic device without series source/drain resistance. Since people usu- 
ally do not use this option when extracting model parameters, we do not discuss this 
option in detail and only give a brief discussion when we discuss the I-V model, 
when rdsMod = 1, where the device is modeled by an intrinsic device and the external 
source/drain series resistance components are modeled by introducing two more nodes as 
shown in Figure 6.19(c). 



6.8.1 I-V Model When rdsMod = 0 (R D S (V) # 0) 

When rdsMod — 0 and A’ds ( V ) ^ 0, the I-V model with so-called “virtual” series 
source/drain resistance components is used, in which the series source/drain resistance 
components are embedded in the I-V equation instead of “real” physical resistance 
components in the model implementation, so the impact of the source/drain resistance 
components is modeled in DC but not in AC and noise simulation (Enz and Cheng 
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2000). The complete single equation channel current model is given by 



I ds — 



I, 



dsO 



7?DS ' AlsO 



1 + 



1 + 



C, 



•In 



elm 



V A 

VasAT 



1 + 



Vds — Vdseff 

Vadibl 



Vdseff 



j Vds — Vdseff \ 
V ADITS ) 



^ Vds — Vdseff \ 

Vascbe ) 



(6.59) 



where the contributions of velocity saturation, channel-length modulation (CLM), drain- 
induced barrier lowering (DIBL), and substrate current -induced body effect (SCBE) to 
the channel current and conductance have been included. We will give some detailed 
discussion below. 

In Eq. (6.59), Am is the channel current for an intrinsic device (without including the 
source/drain resistance) in the regions from strong inversion to subthreshold, 



Weff ' Meff ' Coxeff ' Vgsteff ' ^dseff ' I 1 



Also = 



Vdseff \ 

2WW 



Teff ' ( 1 + 



Melt’ ' Vdseff 
2 ■ Vsat(T) • L e ff 



(6.60) 



where L cf f and W e ff are given by Eqs. (6.7) and (6.9); V d seff is introduced to ensure a 
smooth transition from triode to saturation region and is expressed as 

Vdseff = Vdsa, - \ ■ (Vdsat - V DS - DELTA + V ( V dS a, - V DS - DELTA ) 2 + 4 • DELTA ■ V dsa t) 

(6.61) 

where DELTA is a model parameter; Vdsat is the saturation voltage and is formulated as 



Vdsat 



— b — 'Jb 2 — 4 ■ a ■ c 
2 ■ a 



a ~ ^bulk ' Weft ' VS AT ■ C oxe ’ ^DS + Abulk ' 




b — — 



(Vgsteff + 2 ■ V t ) ■ 



/2 _ \ 2 ■ A bul k ■ VSAT ■ Z, eff 

\ A / Meff 



(6.62) 

(6.63) 



+ 3 ■ Abuik ■ (Vgsteff + 2 • V t ) ■ Weff ■ VSAT ■ C oxe ■ Rds , 



(6.64) 



2 • VSAT ■ L eff 

c = (Vgsteff + 2 ■ Vt) + 2 • (Vgsteff + 2 ■ V,) ■ Weff • VSAT ■ Coxe ’ Rds, 

Me ff 

(6.65) 

A = Al • Vgsteff + A2. (6.66) 
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As given in Eq. (6.59), BSIM4 models the channel current and conductance by using 
the Early voltage concept well known in bipolar device modeling. One assumption has 
been made to derive the model, that is, the contribution of each physical mechanism such 
as CLM, DIBL, SCBE, and so on to channel current and conductance can be calculated 
independently. Figure 6.20 illustrates the contribution of each physical effect to the output 
conductance (resistance). The details of the derivation and analysis can be found in Huang 
(1992), Cheng et al. (1997a), and Cheng et al. (1999); here we introduce briefly the model 
equations and discuss the enhancements of the output conductance model in BSIM4. 

The temperature dependence of saturation velocity is described by 

V sat (T) = VSAT-AT ■ (— 1 ) (6.67) 

\TNOM ) 

where VSAT is the model parameter for saturation velocity at TNOM and AT is a model 
parameter defining the temperature dependence of the saturation velocity. 

Va in Eq. (6.59) is the sum of the contribution of the Early voltage at the saturation 
voltage point and the contribution of the Early voltage from the CLM effect, 

Ea = Lasat + E A clm (6.68) 
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(6.70) 



(6.71) 

(6-72) 

(6.73) 



where PCLM, PVAG, FPROUT are model parameters introduced to improve the accuracy. 
The Early voltage contributed by DIBL effect is formulated as 



Vadibl — 



Vgsteff + 2 ■ Vt 



tfrout ■ (1 + PDIBLCB ■ Vbseff) 
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Kisat 
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(6.74) 

(6.75) 



where PDIBLC1, PDIBLC2, DROUT are model parameters that have been introduced in 
BSIM3v3. However, the cosh function has been used to replace the exponential function 
in BSIM3v3 for the length dependence of Vadibl- 

The Early voltage due to the contribution of the substrate current is calculated by 



Vascbe — 



Le{f e PSCBE 1 -Litl/ Vds - v dsat 

PSCBE2 



(6.76) 



where PSCBE 1 and PSCBE2 are model parameters. The dependence of Early voltage on 
SCBE is determined by Vdsat an d Litl built in the equation. 

An enhancement in the output conductance model in BSIM4 is the inclusion of the 
modeling of the drain-induced threshold shift (DITS) caused by pocket implantation. The 
influence of DITS on the Early voltage can be expressed as 



. . Tpocket 

ADITS ~ PDITS 



• [1 + (1 + PDITSL ■ L eff ) ■ c PDrrsl> ' V|K J 



where PDITS, PDITSL , and PDITLD are model parameters. 



(6.77) 



6.8.2 I-V Model When rdsMod = 1(R DS (V) = 0) 

When rdsMod — 1, the /-V characteristics of the device are described by a subcircuit that 
consists of an intrinsic portion and external series source and drain resistance components. 
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For the intrinsic device ( 7 ?ds ( V ) = 0), in the linear region (V DS < Vd Sat ), the 7-V 
equation has been given in Eq. (6.60) and rewritten as follows: 



hso — 



Weff ■ Meff ' Coxeff ' V gste ff • Vd se ff ■ ^1 — - — — ^ 



^eff ■ 



1 + 



Me ff ' V dse ff 
2 ■ Vsat(T’) ■ L t ff 



where V dsat is the intrinsic saturation voltage and is given by 



Vdsat — 
^sat ~ 



F S at ’ L ■ (Vgsteff T 2 ■ Vt) 

A bulk ‘ F sa t • L T Vgsteff T 2 ■ Vt 
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Meff 



(6.78) 



(6.79) 

(6.80) 



In saturation region, the Eqs. (6.70) to (6.77) are used, except that the parameter Tfos is 
set to zero. The model covers from strong inversion, through the weak inversion transition, 
to subthreshold region. 



6.9 GATE TUNNELING CURRENT MODEL 

As the gate oxide thickness is scaled down to 3 nm and below, gate leakage current due to 
direct carrier tunneling becomes significant. The carriers to conduct the gate leakage can be 
either electron or holes or both, depending on the type of the gate and the bias conditions. 

In BSIM4, the gate tunneling current is modeled by several different components, the 
tunneling current between the gate and the substrate ( / f jB ), the current between the gate and 
the channel ( lac), and the currents between the gate and the source/drain diffusion regions 
(7gs and 7 G d) (Liu et al. (2000)). 7 G c can be further partitioned between the source and 
the drain, that is, 7qc = 7gcs + 7gcd- Figure 6.21 illustrates these gate tunneling current 
components in an nFET. 

Depending on the model applications, the gate leakage model can be turned on or 
off by model selectors igcMod and igbMod. When igcMod = 1, BSIM4 calculates the 
contribution of 7 GC , 7 G s, and 7 G d in the simulation. When igbMod — 1, 7 G b will be 
calculated. When the selectors are set to zero in the model file, no gate tunneling currents 
are calculated in the simulation. 

6.9.1 Gate-to-substrate Tunneling Current /gb 

Depending on the bias conditions, devices can operate in accumulation or inversion 
regimes, in which the gate tunneling current 7 G b can be significant. 

In accumulation region, 7 G B,acc is determined by electron tunneling from conduction 
band and can be expressed as (Liu et al. (2000)) 

7oB,acc — lV c ff ■ 7, e ff ■ A ■ Toxratjo ■ Vqb ■ V aux • cxp[ B ■ TOXE 

■ ( AIGBACC - BIGBACC ■ V oxaC c) • (1 + CIGBACC ■ V oxacc )] (6.81) 
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Figure 6.21 Gate-to-source/drain/bulk tunneling components in a 



where the physical constants A — 4.97232 x 10 7 A/V 2 , B = 7.45669 
BIGBACC, and CIGBACC are model parameters, and 
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(6.82) 

(6.83) 

(6.84) 



where NIGBACC, TOXREF, and NTOX are model parameters; V oxacc is the voltage across 
the oxide of the device in accumulation and has a form of Eq. (6.84) to avoid the discon- 
tinuity of Voxacc from accumulation through depletion to inversion. Vfb Z b is the flat-band 
voltage calculated from zero-bias V* 

Vfbzb = Vth| v bs= y ds= o — Og — K 1 • (6.85) 

and 

^FBeff = Vfbzb ~ ~ ’ (Vfbzb — Vgb _ 0.02 + yj (Vfbzb ~ Vgb — 0.02) 2 + 0.08 ■ Vfbzb) • 

( 6 . 86 ) 

Depending on the bias conditions, devices can operate in accumulation or inversion 
regimes, in which the gate tunneling current 7 G b can be significant. 

In inversion region, is determined by electron tunneling from Valence band and 

can be expressed as 

fgb.inv — VVeff ‘ Eq ff ■ -^inv ' ^oxratio ' Vgb ’ Vaux,inv ' exp[— 5 inv • TOXE 

■ ( AIGBINV - BIGBINV ■ V oxinv ) ■ (1 + CIGBINV ■ V oxinv )l (6.87) 
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where the physical constants Ai nv = 3.75956 x 10 7 A/V 2 , B im — 9.82222 x 10 11 , AIG- 
BINV, BIGBINV, and CIGBINV are model parameters, and 



Vaux.inv = NIGBINV ■ Vt ■ In 
Vox.depinv — X ] ox v^x ~F Vgsteff 



1 + exp 



Voxdepinv EIGBINV \ " 
NIGBINV ■ V, ) 



(6.88) 

(6.89) 



where NIGBINV and EIGBINV are model parameters; Vox.depinv is the voltage across the 
oxide of the device in depletion and inversion regions and has a form of Eq. (6.89) to 
avoid the discontinuity of Vox.depinv from accumulation through depletion to inversion. 



6.9.2 Gate-to-channel and Gate-to-S/D Currents 

The gate-to-channel current in an «FET is determined by the electron tunneling from 
conduction band and the gate-to-channel current in a pFET is determined by the hole 
tunneling from valence band. The following equation can be used to describe the gate- 
to-channel current (/gc) i n both «FETs and /jFETs, 

I QC — lV e ff ■ 7. eft' ■ A • T OX ratio * Vg Se * V a ux,gc (6.90) 

■ exp[— £ • TOXE ■ (AIGC - BIGC ■ V mAtpinv ) • (1 + CIGC ■ V OX de P inc)] 



where AIGC, BIGC , and CIGC are model parameters; A — 4.97232 A/V 2 for nFETs 
and 3.42537 A/V 2 for pFETs, B = 7.45669 x 10 11 g 1/2 /(F 1/2 s) for nFETs and 1.6645 x 
10 12 g 1/2 /(F 1/2 s) for ;jFETs; and 



Vaux.gc = NIGC ■ Vt • In 



1 + exp 



/ V gsc - VTHO \ 
V NIGC ■ V t ) 



(6.91) 



where NIGC is a model parameter. 

In the above discussion, we did not account for the influence of the drain bias. With the 
consideration of the Vd S bias, Iqc is partitioned into two components between the drain 
and the source. The gate-to-channel tunneling current partitioned to the source, Iqcs, is 
given by 



PIGCD ■ V DS + exp (-PIGCD ■ V DS ) - 1 + 10' 4 
,GCS = ,0C {PIGCD ■ Vds) 2 + 2 ■ 10~ 4 



<6.921 



and the gate-to-channel tunneling current partitioned to the drain. / G cd , is given by 



, 1 - ( PIGCD ■ V DS + 1) ■ exp(— PIGCD ■ V DS ) + 1CT 4 

GCD _ GC (PIGCD ■ V DS ) 2 + 2 ■ 10~ 4 

where PIGCD is a model parameter whose default value is given by 



(6.93) 



B ■ TOXE ( _ Vdseff 

^gsteff 



PIGCD = 



2 • Vg S teff 



(6.94) 
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As mentioned earlier, in addition to the gate-to-channel tunneling current, gate tun- 
neling current between the gate and the source/drain region ( Iqs and /gd) should be 
accounted for in modeling of the gate tunneling, 



/(is = Weff ■ DLCIG ■ A ■ ToxratioEdge ' V G s ' Vos' ' exp [ — B ■ TOXE 

■ POXEDGE ■ ( AIGSD - BIGSD ■ V GS ') ■ (1 + CIGSD ■ V GS ')] 



and 



IgD = We ff ■ DLCIG ■ A ■ ToxratioEdge ‘ V G D ’ Gil)' ' CXp[— B ■ TOXE 

■ POXEDGE ■ (AIGSD - BIGSD ■ V GD ') • (1 + CIGSD ■ U GD ')J 



(6.95) 



(6.96) 



where DLCIG , AIGSD, BIGSD, CIGSD, and POXEDGE are model parameters; and 



* oxratioEdge — 



TOXREF 
TOXE ■ POXEDGE ) 



\NTOX 



1 

( TOXE ■ POXEDGE ) 2 ’ 



Vqs' — V (Vgs — VfbSD) 2 + 10 4 , 
Vgd' = V (Vgd — VfbSD) 2 + 10 



(6.97) 

(6.98) 

(6.99) 



where VfbSD is the flat-band voltage between the gate and the S/D diffusions and is given 
by the following equation: 



k B T ( NGAT E \ 

— -In . (6.100) 

q V N SD ) 

6.10 SUBSTRATE CURRENT MODELS 

The modeling of substrate current is important in today’s MOSFETs, especially for ana- 
log circuit design. In addition to the junctions diode current and gate-to-body tunneling 
current, two other current components dominate the contribution of the total substrate 
current in a MOSFET: one is the substrate current due to impact ionization of the channel 
current and the other is the gate-induced drain leakage (GIDL) current. Next, we will 
discuss the models for these substrate current components. 



6.10.1 Model for Substrate Current due to Impact Ionization 
of Channel Current 



The substrate current due to impact ionization of the channel current in BSIM4 uses the 
same model equation as that in BSIM3v3.2, 



ALPHA!) + ALPHA 1 ■ L eS 
lu — ( Eds — Vdseff) ■ exp 

Leff 



BETA!) \ 

/dsa (6.101) 

VDS - Vdseff/ 
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where ALPHAO and BETAO are model parameters for impact ionization coefficients; 
ALPHA 1 is also a model parameter that is introduced to improve the model scalability, and 



Gsa — 



fdsO 



1 + 



R 



1 + 



DS ' AlsO 

Vlscff 

Vds 



1 + 



Vlseft 



V A DIBL 



Gclm 



1 + 



■In 



Va 

Vasat 



Vds 



Vlset’f \ 

Vadits ) 



(6.102) 



where the detailed equations of 7 ds0 , tf ds , V dse f f , Vasat- C dm , V AD ibl, V AD its- and V A have 
been discussed previously when discussing the I-V model. 



6.10.2 Models for Gate-induced Drain Leakage (GIDL) 

and Gate-induced Source Leakage (GISL) Currents 

The models for GIDL and GISL currents can be formulated as (Liu et al. (2000)) 



/oidi. = AGIDL ■ ILcffCj 



Vds — Vgse — EGIDL 
3 ■ T nxe , 



/ 2 ■ T oxe ■ BGIDL \ V DB 3 

6XP V V r)S - V gse - EGIDL) ' CGIDL + V DB 3 ’ 



ffiTST. = AGIDL ■ W ; 



- Vds — V 



effCj ‘ 



gse 



EGIDL 



3 ■ 7L 



(6.103) 



3 • T nxc ■ BGIDL \ 

' ex P I T 7 TT EEArPT I ' 



Vsb 



V DS - Vgse - EGIDL ) CGIDL + V S b 



3 ’ 



(6.104) 



where AGIDL , BGIDL , CGIDL, and EGIDL are model parameters. CGIDL is introduced 
to account for the body-bias dependence of /qidl and /gisl- 



6.11 CAPACITANCE MODELS 

BSIM4 provides three options for selecting different capacitance models. With only a 
minor change in separating CKAPPA parameter into different parameters for gate-source 
and gate-drain overlap capacitances, BSIM4 maintains the model equations and model 
parameters in BSIM3v3. However, the option capMod = 1 in BSIM3v3 is no longer 
supported in BSIM4. So users should be aware of this fact when selecting the capacitance 
models in BSIM4 because the meaning of capMod parameter has been changed in BSIM4 
from that in BSIM3v3. In other words, when one selects capMod = 1 in BSIM4, the 
capacitance model is actually the one for capMod — 2 in BSIM3v3 and when one selects 
capMod — 2 in BSIM4, the capacitance model is actually the one for capMod = 3 in 
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BSIM3v3. The following table gives the correspondence between the capMod parameter 
and the capacitance models/parameters in BSIM3v3 and BSIM4. 

In this section, we will discuss the charge and capacitance models. The charge model 
is the basis of the capacitance model. The space charge of a MOS structure consists of 
three fundamental components: the charge on the gate electrode, Qa, the charge in the 
bulk depletion layer, Q b, and the mobile charge in the channel region, Ginv- Generally, 
the following relationship holds in a MOSFET: 

G g + Ginv + Qb — 0 (6.105) 

and 

Ginv = Gd + Gs (6.106) 

where Gd and Gs are the channel associated with the drain node and the source node, 
respectively. 

Capacitance between any two of the four terminals (gate, source, drain, and bulk) is 
defined as 



C u = ^ i^j; i,j = G,D,S,B, (6.107) 

dVij 

C U = ~ TT i = j> Uj = G,D,S,B. (6.108) 

dV u 

Depending on the bias conditions, the device can operate in accumulation, depletion, 
and strong inversion regions, which are further divided into linear and saturation regimes. 
By selecting different capMod options, different model equations can be used to describe 
the characteristics of the charge and the capacitances. 

6.11.1 Intrinsic Capacitance Models 

(1) Capacitance model option 1 (capMod = 0) 

When capMod — 0 is input in the model file, the piecewise long-channel charge and 
capacitance model is used in the simulation. The capacitance model for this selection 
is simpler than other capacitance models ( capMod = 1 or 2). However, discontinuity in 
both charge and capacitance characteristics exists at the boundary of different operation 
regimes such as from accumulation to depletion and from depletion to strong inversion. 
If Vgs < VFBCV + Vi iS , the device operates in the accumulation region, 

QG — W act ive ■ L active * Coxe (Vos-Vbseff -VFBCV), (6.109) 

Gb = -Gg, (6T10) 

Ginv = 0, (6.111) 

where W ac tive and L actjve are the effective channel width and channel length of the device; 
VFBCV is a user-defined model parameter for the flat-band voltage. 
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If VFBCV + Vbs < Vgs < Vth, the device is in the subthreshold region and the charge 
expression becomes 



Q B — Inactive * ^active * C 0 xe 

Qg — ~ Gb , 

0INV = 0. 



^lox 2 

2 



-I + . 1 + 



4 ■ (V GS - VFBCV — Vbseff) 



K 



lox 



(6.112) 

(6.113) 

(6.114) 



If Vgs > V*, the device is in strong inversion. Similar to the DC case, the device 
operates either in the linear or in the saturation regime, depending on whether the drain- 
to-source bias is lower or higher than the saturation voltage given below: 



Vlsat.cv — 



Vgs — V th 



*bulk,cv 



A 



bulk,cv — ^bulk 



i + l^) 

active / 



CLE 



V th = VFBCV + d> s + K loxV^s - Vbseff, 



(6.115) 

(6.116) 
(6.117) 



where the detailed form of /\ bulk has been given in the discussion of the I-V model; CLC 
and CLE are model parameters. 

When Vqs > Vh, the device operates in strong inversion region. When Vos < Vd sa t,cv, 
one can derive the following equations for the charges at the gate and bulk of the device 
in linear region. 



Qg — lV a ctivc ^active C (IX c 

( 

V GS - VFBCV - <b s 

V 

£2b = ^active ^active Coxe 

/ 

VFBCV - V m - <b s + 

V 



\ 



Vds 



+ 



Ibulk.cv 



Vds 



12 V GS - V, h 



^buik.cv ■ Vds \ 

2 J 



(6.118) 



(l — Abuik.cv) ■ Vds (l — -^bulk,cv) ' -^bulk,cv ' Vds" 



\ 



12- V GS -Vth- 



ibulk. 



,cv • Vds\ 
~2 / 



(6.119) 

The inversion charges can be further partitioned into <2 inv = Qs + Qd- Different 
charge partition schemes, controlled by a model parameter XPART, are adopted to par- 
tition the Q s and Q D into different ratio, 0/100, 50/50, and 40/60 with XPART = 1 (or 
any value larger 0.5), 0.5, and 0 (or any value smaller than 0.5). 
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When XPART = 0 (or <0.5), the 40/60 charge partition scheme is used in modeling 
the channel charge partitioning into drain and source. 



^ „ t T r- V G S _ A bulk cv ^ 

— ^active ' ^active ' '--oxe ^ 2 * 



(VgS _ Vth ) 2 Abulk.cv ■ (Vbs — Vth) (Abulk.cv • Vds) ? 

Abulk,cv ■ Vos ■ g + 55 



V GS - Vth - ■ V DS 



Gs — — (Gg + Gb + Qd)- 



( 6 . 120 ) 

( 6 . 121 ) 



When XPART — 0.5, the 50/50 charge partition scheme is adopted in partitioning the 
channel charge into drain and source. 



Gd — Gs — ^active * /^active * G 0 



Vgs ~ Vih Abuik.c 



(Abulk.cv ■ Vds) 



(6.122) 



~ , (at at Abulk.cv T . \ 

24 ■ ( Vbs - Vth V DS ) 

When XPART = 1 (or >0.5), the 0/100 charge partition scheme is used, and the charge 
at the drain and source can be formulated as 



^ ^ Vgs - Vth , A bu ik,cv T/ 

SdD — ~ W active * ^active * '-oxe * 1 * V DS 



( A-bulk.cv • Vds) 

TA A (at AT Abulk.CV T , 

24 ■ I Vgs - Vth V DS 

Gs = ~(Qg + Gb + Gd)- 



(6.123) 



(6.124) 



When V DS > Vd sa t,cv> the device operates in saturation region. One can derive the 
following equations for the charges at the gate and bulk: 



G G = lV a ctive ^ active Coxe • Vgs “ VFBCV — 4> 



(6.125) 
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2b - W active L active C OX e ■ (vFBCV - V* + <fl s + (1 Abulk ^ } - Vdsat ' cv ^ . (6 . 126) 

Similar to the case in the linear region, the inversion charges can be further partitioned 
into <2 inv = 2 s + 2d- When XPART = 0 (or <0.5), the 40/60 charge partition scheme 
is used in partitioning the channel charge into drain and source. 

2d = _ 4-W active -L active -C oxe (Vgs _ ^ (6 . 127) 

2s = -(2g+ 2b + 2d)- (6.128) 

When XPART — 0.5, the 50/50 charge partition scheme is adopted, 

2d = 2s = -W active ■ L active ' C oxe ■ ^ VgS ~ V(h ^ . (6.129) 

When XPART — 1 (or >0.5), the charge at the drain and source can be formulated as 

2d = 0, (6.130) 

2s = -(2g + 2b)- (6-131) 



(2) Capacitance model option 2 (capMod = 1 ) 

In the capacitance model with capMod = 0, piecewise equations are used in different 
operation regimes, so the model has discontinuities and nonsmooth transitions at the 
boundaries between accumulation and depletion and between subthreshold region and 
strong inversion region. In the capacitance model with capMod = 1, which was the 
capMod = 2 model in BSIM3v3, the discontinuities are removed by introducing smooth 
functions at the boundaries of the operation regimes in deriving the charge/capacitance 
equations. 

While charge and capacitance characteristics of a MOSFET transit between the accu- 
mulation and depletion regimes, a smooth function for the effective flat-band voltage is 
introduced as given in the following: 

^FBeff = Vfbzb — ^ ‘ f ^ fbzb ~ ^ GB ~ 



+ V (Ffb Z b — Vqb — 0.02) 2 + 0.08 ■ Vfb Z b 
Vfbzb = Vth|y BS= v DS= o — ‘bs — K \ ■ v^. 



(6.132) 

(6.133) 



Another smooth function V gs teff,cv> similar to the Vg Ste ff in DC model but with a simpler 
form, is used to ensure the continuous transition from depletion to strong inversion. 



Vgsteff.cv = NOFF ■ n ■ V t ■ In 



1 + exp 



f Vgse ~ Vth ~ VOFFCV \ 
V NOFF ■ n ■ V t ) 



(6.134) 



where NOFF and VOFFCV are model parameters. 
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To smooth out the transition of charge/capacitance between the linear and saturation 
regions, one smooth function, similar to Eq. (6.61) in I-V model, is used with a constant 
value (0.02) for £ 4 , a DELTA-like parameter, 

Vcveff ~ f)lsar.cv ' ^^dsat.cv ^DS *^4 T yj 1 V| sa t.cv V])g ^4)” T 4 ■ <$4 V ( | sa . cv 

(6.135) 

where Vdsat.cv is the saturation voltage given by Eq. (6.115). 

With the considerations of the charge/capacitance model continuities, the capMod — 1 
model can be derived and the following relationships hold: 



Qg — -(Qinv + 6b), 

6b = — (6acc + 6subo + <56sub), 

6aCC — Inactive * ^active * C 0 xe * (l^FBeff Vfbzb), 

K\ox 2 

6sUB 0 = — ^active ' ^active * Coxe * ~ 



^ ^ ^ 4 (Vg Se Vpg e ff Vg S teff,cv ^bseff) \ 

V ^7 / 



^£2 SUB — ^active * ^active * Coxe 

1 -^bulk,cv T/ 

* ^cveff — 

QlNW = ^active * ^active * ^ox 
T cv 



(l -^bulk,cv) * ^bulk,cv * ^cveff 

12 ■ ( Vgs.eff.cv - ■ Vcveff 



-^bulk.cv ■ Vcveff” 



fgsteff.cv 



Vcveff + 



12 ■ V r 



dhulka 



'gsteff.cv 



Vcveff 



(6.136) 

(6.137) 

(6.138) 



(6.139) 



(6.140) 



(6.141) 



When XPART — 0.5, the 50/50 charge partition scheme is used. The charges at the 
source and drain terminal can be described by 



6s = 6 d = 



6 inv 



— IVicrive ‘ T a ctive * Cox * 



V 



^bulka 



gsteff,cv 



Vcveff 



(6.142) 



+ 



-A bulk ,cv" • Vcveff 



12 ■ I V gst eff,cv - ■ Vcveff 
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When XPART = 0 (or <0.5), the 40/60 charge partition scheme is used, 



2 s = 



^active * ^active * C 0 xe 



2 ■ | Vgsteff.cv - ■ V CV eff 



3 i 2 

^gsteff, 

cv 3 ' Vgsteff, cv * -^bulk,cv ■ Vcveff 



4“ ‘ Vgsteff.cv ' (^bulk.cv * V C veff)~ 



15 



(-^bulk.cv * Vcveff) 



(6.143) 

2d — — (Qg + 2b + 2s)- (6.144) 

When XPART = 1 (or >0.5), the 0/100 charge partition scheme is adopted, 



2s = — 



Inactive ‘ L active * C 0 xe 



1 

Vgsteff.cv 4“ ' -4 bulk ,cv ^cveff 



(^bulk ,cv Vcveff) 



12 ■ (Vgsteff.cv ' V cv eff) 



(6.145) 



2d — — ( Qg 4- 0b 4- 2s)- 



(6.146) 



(3) Capacitance model option 3 (capMod =2) 

The capacitance model with capMod — 2 in BSIM4 is compatible with the capacitance 
model with capMod = 3 in BSIM3v3 as shown in Table 6.1. This model accounts for 
the quantization effect by considering finite charge thickness in deriving the charge and 
capacitance equations. Similarly, the following equations hold: 

2g = ~(Qinv + Qb), (6.147) 

2b = — (2 acc 4- 2subo 4- <52sub)- (6.148) 



Table 6.1 BS1M4 capacitance model options 



Capacitance models in BSIM4 


Model details referencing to BSIM3v3 


CapMod = 0 


Copied from CapMod = 0 model in 
BSIM3v3 


CapMod = 1 


Copied from capMod = 2 model in 
BSIM3v3 


CapMod = 2 (default model) 


Intrinsic capacitance model copied 
from capMod = 3 and 
overlap/fringing capacitance model 
copied from capMod = 2 in 
BSIM3v3 
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With the consideration of the finite charge thickness in the channel, the charge in the 
accumulation region can be derived as 



Q ACC — ^active * ^active * Coxeff * Vgbacc (6.149) 

where C oxe ff is the effective gate oxide capacitance given in Eq. (6.1); Vgbacc is a smooth 
function for the effective gate-to-body voltage introduced to ensure the model continuity 
from accumulation to depletion and is given by 

Vfbzb + Vbseff - Vgse ~ 0.02 + J (Vfbzb + Vbseff - Vgse - 0.02) 2 + 0.08 ■ V ftzb . 

(6.150) 

The other charge components for <2 b defined in Eq. (6.148) are with the follow- 
ing forms: 




<2sUB0 — fk ac tive * ^ active ' 6' OX eff 



K 



lox 



^ y ^ ^ y ^ 4 * (^gse ^FBeff ^gsteff,cv Vbseff) ^ 



(6.151) 



^0SUB — ^active ' ^active ' C'oxeff’ 



1 -^bulk,cv T/ (l -4 bulk ,cv) * -^bulk,cv * Ecveff 

» ’ k cve ff 



n i 1 / ^bulk.cv .. 

1— ■ | Vgsteff.cv ^8 ~ ' kcveff 



(6.152) 



where cps is the potential difference between the real surface potential and the 20 b constant 
and is expressed as follows: 



tp s - V t In 



Vgsteff.cv ' ( Vg s teff,cv ~b 2 ■ K \ ox ■ \/2 ' 0/; ) 

MOIN ■ K\ m - V t 2 




(6.153) 



The inversion charge can be expressed as 



0 1 N V — Wactive * ^active * f^oxctl 1 



Vgsteff.cv PS 2 ' ^bulk.cv * V C veff 



+ 



(rfbulk.cv ‘ Vcveff) 



12 V 



Vgsteff.cv <PS 



rfbulk.cv ’ Vcveff 



(6.154) 
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When XPART — 0.5, the channel charge partitioning to the source is formulated as 



.. . . 1 „ ^active ‘ ^active ' Coxeff 

Gs — Gd — ~ GlNV — 



l^gsreft.cv (p<) ^ ^bulk.cv * Icveff 



+ 



(^bulk ,cv ^cveff) 



. ^ i T/ -4bulk,cv * ^cveff \ 

‘ [ ^gsteff.cv (P8 ^ I 



(6.155) 



When XPART = 0 (or <0.5), the channel charge at the source according to the 40/60 
partition scheme is 



Gs = 



^active ' ^active ' ^oxeff 



2 • ( Vc 



'gsteff.cv *PS 



-^bulk.cv ’ l^cveff 



(^gsteff,cv (P&) ^ * (^gsteff,cv tys) 



2 'y 2 * (-4-Kiiiv pv * 1/pvpff) 

■ bulk,cv • Vcveff + ~ • (Vgsteff.cv — <P&) • (^bulk.cv • Vcveff) — 

(6.156) 

Gd = ~(Qg + Gb + Gs )• (6.157) 

When XPART = 1 (or >0.5), the channel charge at the source terminal can be given by 



Gs = - 



^active * ^active * G 0 xeff 



1 

fgsteff. cv T<) ^ bulk.cv ' f'eveff 



(^bulk.cv ' l^cveff) 



12 ■ ^Vgsteff.cv *P8 

Gd = —(Qg + Gb + Gs)- 



^bulk.cv * Fcveff 



(6.158) 

(6.159) 



6.11.2 Fringing/Overlap Capacitance Models 

Fringing capacitance consists of a bias-independent outer fringing capacitance and a 
bias-dependent inner fringing capacitance. Inner fringing capacitance is more complex 
to model. Only the bias-independent outer fringing capacitance is included in BSIM4 by 
introducing a model parameter CF. If CF is not given in the model file, it is calculated by 



CF = 



2 ■ EPSROX ■ so 
n 



• In 



(i+iiin. 

V TOXE ) 



(6.160) 
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The overlap capacitance was modeled with a bias-independent component in previous 
older MOSFETs. It is in the case of the overlap capacitance model that capMod = 0 in 
BSIM4. However, it has been found that the overlap capacitance shows a strong bias 
dependence. Depending on the bias condition between the gate and the source/drain, the 
overlap region between the gate and the source/drain can be in accumulation, or depletion. 
The bias-dependent overlap capacitance was modeled in the capacitance models when 
capMod — 1 and 2. 

When capMod — 0 is given in the model file, the gate-to-source, gate-to-drain, and 
gate-to-bulk overlap charges are given by 



{2 overlap, s — Inactive * CGSO • VqS, 
(J o\ crap.,: — fV ac tive * CGDO • V( P) - 
(?overlap,6 ^active * CGBO ■ V G B- 



(6.161) 

(6.162) 

(6.163) 



When capMod — 1 or 2, in the model file, the gate-to-source charge is given by 



overlaps 

Inactive 



CGSO ■ V GS + CGSL 



Vc,S Vg S) overlap 



CKAPPAS 



-1 + J 1 — 



r Vgsoverlap 

CKAPPAS 



' gs, overlap 



= \ ■ (V„s 



+ 0.02 



'(Vbs + 0.02) 2 + 0.08) 



and the gate-to-drain overlap charge is given by 



overlap, d 



= CGDO ■ Vgd + CGDL 



(6.164) 



(6.165) 



V( ID ^gd, overlap 



CKAPPAD 



— 1 + \ 1 — 



r ’ rgd, overlap 

CKAPPAD 



r gd, overlap 



= \ ■ (Vgd 



+ 0.02 



'(Vgd + 0.02) 2 + 0.08) , 



(6.166) 



(6.167) 



and the gate-to-bulk overlap charge is given by 



overlap, Z t 



= CGBO ■ Vqb 



(6.168) 



Table 6.2 Calculation of the default values of CGSO, CGDO, and CGBO parameters 



Conditions 



If CGSO is not given and DLC > 0 
If CGDO is not given and DLC > 0 
If CGSO is not given and DLC < 0 
If CGDO is not given and DLC < 0 
If CGBO is not given 



Calculation 



CGSO = DLC C oxe - CGSL (= 0 if <0) 
CGDO = DLC Coxe - CGDL (= 0 if <0) 
CGSO = 0.6X7 C oxe 
CGDO = 0.6X7 C 0 xe 
CGBO = 2DWC Coxe 
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where CGSO, CGDO, CGBO, CGSL, CGDL, CKAPPAS , and CKAPPAD are model param- 
eters. The default values for CGSL and CGDL are zero. If CGDO, CGSO, and CGBO are 
not given in the model file, they are calculated by the following expressions in Table 6.2. 



6.12 HIGH-SPEED (NON-QUASI-STATIC) MODEL 

As the circuit clock rate gets faster and faster, the need for an accurate prediction of 
device/circuit behavior near the cutoff frequency or under very rapid transient operations 
increases. In BSIM4, a charge-deficit non-quasi-static (NQS) model, based on the one in 
BSIM3v3 (Chan et al. (1994)) but with many improvements, is included. Model selec- 
tors such as trnqsMod and acnqsMod are introduced to turn on or off the NQS model, 
depending on the applications. The default values for trnqsMod and acnqsMod are zero. 
When trnqsMod = 1 , the charge-deficit NQS model is used in transient simulation. When 
acnqsMod — 1, the charge-deficit NQS model is used in AC simulation. The transient 
and AC NQS models are developed from the same fundamental physics, that is, the 
channel/gate charge response to the external signal is relaxation time (r)-dependent and 
the transcapacitances and transconductances for the AC analysis can be expressed as a 
function of jco r. However, in the model implementation, the AC NQS model does not 
require the internal NQS charge node that is needed in the transient NQS model. 



6.12.1 The Transient NQS Model 

Figure 6.22(a) illustrates how the NQS effects in a MOSFET are modeled in BSIM3v3 
(Chan et al. (1994)) and Figure 6.22(b) gives the RC equivalent circuit of the charge- 
deficit NQS model for transient simulation. An internal node <2def(0 i n addition to the ones 
for a typical four-terminal MOSFET is added to keep track of the amount of deficit/surplus 
channel charge necessary to reach equilibrium. The resistance R is determined from the RC 
time constant (t). The current source t c heq(0 represents the equilibrium channel charging 
effect. The capacitor C with a typical value of 10 9 F is implemented to improve the 
simulation accuracy and the <2 def can be expressed as 

Qdef — Kiel ■ Cfact - (6.169) 

With the consideration of both the transport and the charging component, the total 
current related to the terminals D, G, S can be written as 



Id.g.s — ^d,g,s (DC) + 



9(?d,g.s(0 

dt 



(6.170) 



On the basis of the relaxation time approach, the terminal charge and the corresponding 
charging current are modeled by 



0def(0 — Gcheq(0 QchD) 



(6.171) 
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Conventional 

Equivalent RC network quassi-static models 




(b) 



Figure 6.22 (a) Quasi-static and non-quasi-static models in circuit simulations and (b) equivalent 

circuit for charge-deficit NQS model for transient analysis 



and 

( 6 . 172 ) 

( 6 . 173 ) 

where D, G, ,S' xpar1 are charge-deficit NQS channel charge partitioning numbers for termi- 
nals D, G, and S, respectively; D xpart + .S' xpalt = 1 and G xpart = — 1. 

The transit time r is equal to the product of R u and WeffL e ffC oxe , where R„ is the 
intrinsic-input resistance (HR), including both the drift and the diffusion components of 
the channel conduction, and is given by 



dQdef(t) _ 3<2cheq(0 _ <2def(0 

3 1 3 1 T 

3 Qd,g,s(0 rv o Qdef(t) 

r — U, Lr, Oxpart > 

dt T 
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6.12.2 The AC NQS Model 

For small-signal simulation, by substituting Eq. (6.171) into Eq. (6.173), Q C h(0 can be 
transformed into the following in the frequency domain: 



A0 c h(O = 



A QcheqO) 

1 + j ■ CO ■ T 



(6.175) 



On the basis of the above equation, the transcapacitances Cq i, Csu and Coi (i stands 
for any of the G, D, S, and B terminals of the device) and the channel transconductances 
G ,„ , Gds, and G m b S all become complex quantities. For example, the G m has the form 



and 



G m = 



Cdg = 



T m0 



1 + CO 2 ■ T 2 



Gdgo 

1 + CO 2 ■ T 2 



J ' 



J ' 



CO - T ■ G m 0 ^ 

1 + CO 2 ■ X 2 ) 


(6.176) 


CO ■ T ■ CdG0^\ 
1 + CO 1 ■ T 2 / 


(6.177) 



where G m o and Cdgo are the DC transconductance and transcapacitance at operation 
bias condition. 



6.13 RF MODEL 

6.13.1 Gate Electrode and Intrinsic-input 
Resistance (HR) Model 

It has been known that the gate resistance should be included in a MOSFET model for RF 
applications. BSIM4 provides four options for modeling gate electrode resistance (bias- 
independent) and intrinsic-input resistance (HR, bias-dependent). The HR model can be 
considered as a first-order NQS model. Thus, this model should not be used together 
with the charge-deficit NQS model. A model parameter rgateMod is introduced to select 
different gate resistance models in BSIM4. 

When rgateMod = 0 (zero resistance), no gate resistance is included in the simulation. 
This is the default selection in the model and is the “classic” MOSFET model for digital 
and analog applications for three decades, as shown in Figure 6.23. 

When rgateMod = 1, a resistor with constant resistance is added in the model by intro- 
ducing an internal gate node as shown in Figure 6.24. The component R ge itd is given by 

rshg(xgw+ ^ effcj ) 

R g eitd = i 3-NGCOn) 

8 NGCON ■ (L drawn - XGL) ■ N F 





Figure 6.23 The model without gate resistance (rgateMod = 0) 
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Figure 6.24 Illustration of the model with a constant resistance at the gate (rgateMod = 1) 

where RSHG, NGCON, XGW, and XGL are model parameters for gate sheet resistance, 
numbers of gate contacts, distance from the gate contact to the channel edge, and the 
offset of the gate length due to variation in patterning, respectively. Np is an instance 
parameter for the number of fingers of the devices. 

When rgateMod — 2, an intrinsic-input resistance model with variable resistance is 
used in the simulation. The gate resistance is the sum of the electrode gate resistance 
given by Eq. (6.178) and the intrinsic-input resistance given by Eq. (6.174). Only one 
internal gate node is created as in the case for rgateMod = 1 and no additional node is 
added for the additional bias-dependent gate resistance, as given in Figure 6.25. Since 
this model includes the first-order consideration of the NQS effect, the model selectors 
for NQS models should be turned off by setting trnqsMod = 0 and acnqsMod = 0. 

When rgateMod = 3, the model equations remain the same for gate resistance, but 
in the model implementation, an additional node is added for the intrinsic-input gate 
resistance as shown in Figure 6.26, so the AC current through the overlap capacitance is 




Figure 6.25 The model with both a constant and a bias-dependent gate resistance ( rgateMod = 2) 




Figure 6.26 Model with two additional nodes for the gate resistance components 
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i 




Figure 6.27 The five-resistance substrate network 



parallel to the current component through the instead of passing through it, which is 
the case when rgateMod = 2. Similar to the case when rgateMod = 2, trnqsMod — 0 
and acnqsMod = 0 should be selected in the model file. 



6.13.2 Substrate Resistance Network 

At high frequency, signal coupling through the substrate should be accounted for in the 
circuit simulation. To do that, the substrate components in a MOSFET should be modeled. 
BSIM4 offers a flexible built-in substrate resistance network. A model selector rbodyMod 
is used to turn on this option in the simulation. When rbodyMod — 0, the “classic” 
MOSFET model without substrate resistance network will be used in the simulation. 
When rbodyMod = 1, a five-resistance substrate network, as shown in Figure 6.27, is 
introduced. In the model implementation, a minimum conductance, GBMIN, is created in 
parallel to each resistance component to prevent any infinite resistance values during the 
simulation. Please note that additional nodes, such as sbNode, dbNode, and bNodePrime in 
Figure 6.27, have to be introduced for the model with the five-resistor substrate network. 
In the model implementation, the internal body node bNodePrime has been assumed 
as the reference point for the substrate defined in the intrinsic model. Thus, the impact 
ionization current /„ and the GIDL current /qidl will flow into bNodePrime . 



6.14 NOISE MODEL 

The noise model has been improved in BSIM4 compared with BSIM3v3. The following 
noise sources are modeled in BSIM4, such as flicker noise, channel thermal noise, induced 
gate noise and the correlation with channel thermal noise, the thermal noise due to the 
resistances at the terminals, and the shot noise due to the gate tunneling current. Different 
model selectors have been introduced to use the noise models needed in the simulation, 
as discussed below. 




NOISE MODEL 195 



6.14.1 Flicker Noise Models 



Two flicker noise models are included in BSIM4. A flicker noise model selector fnoiMod is 
introduced to use a specific model in the simulation. When fnoiMod — 0, a simple flicker 
noise model that is convenient for hand calculation and model parameter extraction is used. 
When fnoiMod = 1, a unified physical flicker noise model is used. Both the models come 
from BSIM3v3, but the unified model has been improved in BSIM4 in several aspects 
such as the noise characteristics predicted by the model transit smoothly over different 
bias regions and also the bulk charge effect has been accounted for. The default model, if 
the user does not define any specific noise model in the simulation, is the unified flicker 
noise model. 

When fnoiMod — 0, the spectral drain current noise power density is given by 



SAf) = 



KF ■ I DS AF 
Coxe ' L e ff ■ f FF 



(6.179) 



where KF , AF, and EF are model parameters and / is the operation frequency of 
the device. 

When fnoiMod = 1, the spectral drain current noise power density is formulated as 



SAf) = 



,inv (/) ■ %sub(/) 

Sid ,inv (/) + S id , sub (/) 



(6.180) 



where Si d ,inv(/) and .S' K i,xub(./) are the spectral drain current noise power density of the 
device in the inversion and the subthreshold regions, respectively. 

The spectral drain current noise power density in the inversion region is expressed as 



Sid,inv(/) = 



&B ■ T ■ q 2 ■ / DS ■ Heff 
f E f ■ L t ff 2 ■ C oxe ■ 10 10 L 
NOIC 



, N 0 + N* , 

NOIA ■ In — + NOW ■ (N 0 - N,) 

N, + N* ' 



+ - 



C No 1 - Nr ) 



+ 



Kr -T- I ds z ■ A L 



-'dm 



f R f ■ Leff 2 ■ Wes ■ 10 10 



NOIA + NOIB ■ N, + NOIC ■ N, ~ 

Wi + N *) 1 

(6.181) 

where NOIA, NOIB , and NOIC are parameters; N 0 , A), N*, and AL dm are given as 



N 0 



Ni 



* 



O'oxe ‘ Vg S teff 



Coxe ' Fgsteff / AbQjif ■ Vdseff 

<■ 1 \ fgsteff + 2- V t 

V, ■ (Coxe + C d + CIT) 



(6.182) 

(6.183) 



N : 






(6.184) 
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and 



/ V'ds 



Vdseff + EM ^ 



AL c i m = Lit! ■ In 



Litl 

2 ■ VSAT 



/Off 



(6.185) 



where EM is a model parameter and the expression of Litl has been introduced before. 
The spectral drain current noise power density in the subthreshold regime is given by 



Sid,sub(/) 



NOIA ■ k B ■ T ■ / DS 2 
f E f ■ Lett- W eS -N* 2 - 10 10 ' 



(6.186) 



6.14.2 Channel Thermal Noise Model 

Two options for the channel thermal noise are provided in BSIM4. One is the charge- 
based model from BSIM3v3 and the other is the holistic model. They can be selected by 
a model parameter tnoiMod. 

When tnoiMod = 0, the charge-based thermal noise model is used in the simulation. 
The noise current is given by 



Ki = 



4- K R T ■ A/ 



■ NT NO I 



Rm(V) + 



^eff 



H eft’ \(2u 



(6.187) 



where A’ds ( W) is the bias-dependent LDD source/drain resistance, NTNOI is the model 
parameter introduced to improve the simulation accuracy, especially for short-channel 
devices, and Q inv is modeled by 



Q inv = Wu 



active * ^active ' 



Ooxeff ' 



T7 A bulk ' Tdseff 

Vgsteff — r 



4 bulk ' I'dseff 



12 • y £ 



ibulk ' 



l^dseff 



Vgsteff ' 



(6.188) 

Figure 6.28(a) illustrates the schematic of the thermal noise model with tnoiMod = 0. 
When tnoiMod — 1, the holistic thermal noise model is used. In this thermal noise 
model, all the short-channel effects including the velocity saturation effect incorporated 




Figure 6.28 Schematic of the equivalent circuit of the noise model; (a) tnoiMod = 0 and 
(b) tnoiMod = 1 
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in the I-V model are automatically included in the noise calculation. In addition, the 
amplification of the channel thermal noise through G m and G m b S as well as the induced 
gate noise with partial correlation to the channel thermal noise are all captured in the 
new “noise partition” model. Figure 6.28(b) illustrates the schematic of the thermal noise 
model with tnoiMod = 1, in which part of the channel thermal noise source is partitioned 
to the source side. 

The noise voltage source partitioned to the source side is given by 

T 2 = 4 -k B -T ■ 6 tnoi ■ Vdse ; f - A/ (6.189) 

Ids 



and 



Omni — 0.37 ■ 



1 + TNG IB ■ L eff ■ 



f Meff ' Vgsteff \ 

\2 ■VSAT-L's) 



(6.190) 



The noise current source in the channel region with gate and body amplification is 
given by 



id 



and 



2 _ 



4-k B -T- V dsef f ■ A / 
7ds 



Anoi = 0.577 



[<JDS + Anoi (G m + Gmbs)]" — V<j 2 • ( G m + GdS + G„,bs ) 2 

(6.191) 



1 + TNOIA ■ L eff ■ 



Me ff ‘ Fgstctf 

2 VSAT- L ef f 



(6.192) 



6.14.3 Other Noise Models 

6.14.3.1 Thermal noise models for parasitic resistances 

BSIM4 calculates the thermal noise contribution from the parasitic resistances at the gate, 
the drain, the source, and the substrate. The power spectral density of the noise current 
from the gate resistance is given by 



Su.R, = 



4 • L b ■ T 

R, 



(6.193) 



The power spectral density of the noise current from the drain parasitic resistance is 
given by 

4 • L b • T 

Sit,*, = —£ • (6.194) 

An 



The power spectral density of the noise current from the source parasitic resistance is 
given by 

4 ■ L b ■ T 
SiuRs - ■ 



(6.195) 
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The power spectral density of the noise current from one of the five substrate resistances 
is given by 

4 ■ k B ■ T 

Sn.R^ = ' (6.196) 

^Vsubx 

where 7? SU b X can be RBPS, or RBPD, or RBSB. or RBDB. or RBPB. 

6.14.3.2 Shot noise model for gate tunneling current 

Shot noise contributed from the gate tunneling current is also modeled in BSIM4. On the 
basis of the gate tunneling current model discussed earlier, the shot noise spectral density 
can be calculated by 

5sn,igtx = 2 ■ q ■ Ictx. (6.197) 

where Iqu can be the gate-to-source tunneling current, or the gate-to-drain tunneling 
current, or the gate-to-substrate tunneling current. 

6.15 JUNCTION DIODE MODELS 
6.15.1 Junction Diode I- V Model 

Three junction diode model options are provided in BSIM4. A model parameter dioMod 
is introduced to select a specific diode model in the simulation. When dioMod = 0, a 
resistance-free diode model is referred. In this model, no current limiting feature, which 
is modeled in the diode model when dioMod = 1 and 2, is accounted for. However, 
depending on whether the model parameter XJBVS or XJBVD is given in the model file, 
the junction breakdown feature can be turned on or off in the simulation. The dioMod — 1 
diode model comes from BSIM3v3, in which the current limiting feature was included 
in the forward-bias region by introducing a current limiting parameter IJTHSFWD or 
IJTHDFWD. However, the junction breakdown is not modeled. When dioMod = 2, the 
junction diode model accounts for the diode breakdown with current limiting in both 
forward- and reverse-bias regions. In the following, the diode model equations for the 
source region are discussed. The equations of the diode model in the drain region are 
exactly the same except replacing the subscript “s” for the source region with the subscript 
“d” for the drain region. 

When dioMod = 0, the junction current in the source region can be given by 

/jbs = Abs ■ [exp ( VjBS \ — ll ■ T 1 + XJBVS ■ exp ( BVS + V J BS ^1 (6.198) 
J L V \NJSV t ) J L \ NJS-V t Jj 

where / sB s is the total saturation current from the contributions of both the area component 
and the periphery component, including both the gate-edge and the isolation-edge sidewall 
components, and will be given later after discussing all the model options. 

When dioMod = 1, the junction current in the source region can be given by 




(6.199) 
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and 



TjBS = AtjBSfd — A BS + 



' vjSBfd 



NJS ■ V t 

where Vj BS fd and BS fd are given by 

A’jBSfd = AbS ' ex P 

VjBSfd =NJS- V t • In ( 1 + 



(VjBS — VjBSfd) VjBS > VjBSfd (6.200) 



VjBSfd 

NJS ■ V , ) ’ 

IJTHSFWD \ 
Abs / 



( 6 . 201 ) 

( 6 . 202 ) 



When dioMod — 2, both current limiting and breakdown are included in the model. The 
junction current in the source region is given by 



AbS = 



, XJBVS ■ (VjBS — VjBSrev) ( BVS - V jBSrev 
AjBSrev ~ AbS exp 1 



exp 



Vbs 



7jBS = AbS 



NJS ■ V, , 
VjBS 



- 1 



exp 



- XJBVS ■ exp - 



NJS ■ V, 

BVS+ VjBS 



NJS ■ V t 



VjBS < VjBS rev? 



+ XJBVS ■ exp 



BVS 



NJS ■ V, 



1 



(6.203) 



NJS ■ V t 



Abs — Abs 



Vjbsfd , , 

exp | — + exp 

F ' NJS ■ v t 



NJS ■ V, , 

VjBSrev < VjBS < VjBSfd- 
BVS 



(6.204) 



1 



XJBVS ■ exp 



Vjbsfd 
' NJS- v t 



NJS ■ v, 

BVS+ Vj bsfd 



exp 



NJS ■ v, 



where V jBS rev and AjBSrev are given by 



T , DI/C , NJS ■ V t , (1JTHSREV\ 

VjBSrev - BVS ■ In / rs J . 



A'jbsiev — Abs 



1 + XJBVS ■ exp 



BVS + Vj 



jbs 



NJS ■ v t 



Vjbs — Vjbsfd, 

(6.205) 

(6.206) 
(6.207) 



In the above, IJTHSREV and IJTHSFWD are model parameters introduced to limit 
the current in reverse bias and forward bias. 

In all the diode models above, the saturation current Abs is used and is given in 
the following: 



AbS — ^seff ■ As(T) + Pseff ' Asws <T) + W e ff c j • NF ■ Aswgs(T) (6.208) 

where A scff and P S eff are the effective area and peripheral length of the source region, and 
they will be discussed in detail in the next section. J SS (T), Asws (T), and -Aswgs ( T ) are 
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given by 



J SS (T) = JSS ■ exp 



E s (TNOM ) E g (T) 

V x (TNOM) ~ VJT) 



+ XTIS ■ In 



TNOM 



NJS 



J SSWS (T ) = JSSWS ■ exp 



E g (TNOM) E g (T) 
V { (TNOM ) ~~ Vt(T) 



+ AT/S ■ In 



TNOM 



NJS 



(6.209) 



(6.210) 



and 



/sswgs(r) = TSWG.S' ■ exp 



E g (TNOM) E g (T) 
V t (TNOM) ~ V t (T) 



+ XTIS ■ In 






( T 



TNOM 



NJS 



( 6 . 211 ) 



where E g is the energy band gap of silicon and is calculated by 



EAT) = 1.16 



7.02 ■ 10“ 4 • T 2 
T + 1108 



( 6 . 212 ) 



6.15.2 Junction Diode Capacitance Model 

Source/drain junction capacitances consist of three components, the bottom junction 
capacitance, the sidewall junction capacitance along the junction edge at the isolation 
side, and the sidewall junction capacitance along the junction edge at the gate side. In the 
following, we discuss these capacitance components, taking only the source junction as 
an example. The corresponding model equations for the drain junction can be given by 
replacing the subscripts from “s” to “d”. 

The total junction capacitance for a junction capacitance at the source side can be 
described by the following equations: 



CBS — ^seff ' CjflS + -Pseff ‘ CjBSsw + ^effcj ■ Nf ■ CjBSswg (6.213) 

where CjBs is the unit-area bottom S/B junction capacitance, Cjbssw is the unit-length S/B 
junction sidewall capacitance along the junction edge at the isolation side, CjBSswg is the 
unit-length S/B junction sidewall capacitance along the junction edge at the gate side. 
A se ff and P seff are the model parameters for effective area and periphery length. Np is the 
number of fingers for a multifinger device. 

The parameter Cjbs in the above is calculated by 

/ Vbs \- mjs 

Cjbs = Cjs(r) • f 1 - V BS < 0, (6.214) 

C jBS - CjsCT) ■ ^1 - MJS ■ Vbs > 0. (6.215) 
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The parameter Cjbsws in the above is calculated by 

/ \ HS \- MJSW 

CjBSws = Cjsws(T) ■ ( 1 — — — J ^bs < (6.216) 

Cjbsws = Cjsws(r) • (1 - MJSW ■ 7bs _J Vbs > 0. (6.217) 

V * BSWS (T ) ) 

The parameter CjBSwsg in the above is calculated by 

/ y BS \“™ G 

CjBSwsg — Cjswsg(T’) -ll — —— I V B s < 0, (6.218) 

\ T’bswsg(i ) / 

CjBSwsg = Cjswsg(C) ■ f 1 - MJSWG ■ ) V B s > 0, (6.219) 

\ gbswSGU)/ 

where MJ, MJSW, and MJSWG are model parameters. The temperature dependences of 
the junction capacitances are described by the following: 

Cjs(r) = CJS ■ [1 + TCJ ■ (T - TNOM)\, (6.220) 

Cjsws (T) = CJSWS ■ [1 + TCJSW ■ (T - TNOM )], (6.221) 

Cjswsg(T’) = CJSWGS ■ [1 + TCJSWG ■ ( T - TNOM)]. (6.222) 

The temperature dependences of the built-in potentials on the source side are des- 
cribed by 



P bs (T) = PBS - TPB ■ (: T - TNOM), (6.223) 

/jisws (T) = PBSWS - TPBSW ■ (T — TNOM) , (6.224) 

Pbswgs(T) = PBSWGS - TPBSWG ■ (T - TNOM). (6.225) 



6.16 LAYOUT-DEPENDENT PARASITICS MODEL 

BSIM4 considers the layout geometry -dependent parasitics model, which can calculate 
the parasitic geometry information according to the layouts of the devices, such as isolated, 
shared, or merged source/drain, and multifinger, and so on. Depending on the layout 
difference, many new model parameters have been introduced to select the model for the 
corresponding parasitic geometry calculation. 



6.16.1 Effective Junction Perimeter and Area 

In this section, only the case of the source-side junction is discussed. The same approach 
is used for the drain-side junction. 




202 THE BSIM4 MOSFET MODEL 



If a PS parameter is given in the model file, the effective perimeter of the source 
junction. P se ff, can be calculated by 

P seff = PS perMod = 0, (6.226) 

P seS = PS - Weffej • NF perMod / 0. (6.227) 

Please note that P se ff (not PS) will be used in calculating the junction DC and CV in 
BSIM4 and it refers to the total source junction periphery length subtracted by total gate- 
side periphery. Also, the meaning of PS is different depending on value of the perMod 
parameter in the model file. When perMod — 0, the value of PS does not include the 
periphery length at the gate side and hence P seff = PS. When perMod is not zero, PS 
follows the traditional definition, which includes the contribution of the periphery length 
at the gate side. However, the P se ff parameter subtracts the contribution of the periphery 
length at the gate side by using Eq. (6.227). 

If AS parameter is given in the model file, the effective area of the source junction. 
A se ff, is calculated by 

Aseff = AS. (6.228) 

If PS and AS parameters are not given in the model file, both P se ff and A se ff will be 
calculated by the parameters Np, DWJ. geoMod. DMCG, DMCI, DMDG, DMCGT , and 
MIN. 

It has been a common practice that compact MOSFET models consider only the case of 
a single-finger (isolated) device, when calculating the source/drain geometry parameters. 
In real applications, the layout structures are more complex than that. A more common 
example in analog/RF applications is a multifinger device, in which the source and drain 
share the same junction area to save the total area of the device and reduce the parasitic 
capacitance. Another example is a device with merged junctions, which are widely used 
in digital applications. BSIM4 takes these cases into account, and the calculations of the 
geometry parameters (and hence the parasitics) in source/drain regions are determined by 
the model parameter geoMod as shown in Table 6.3. 



Table 6.3 geoMod parameters and the layout options of the source/drain regions 



geoMod 


End source 


End drain 


Note 


0 


Isolated 


Isolated 


Np must be odd 


1 


Isolated 


Shared 


Np can be odd or even 


2 


Shared or none 


Isolated 


Np can be odd or even 


3 


Shared or none 


Shared or none 


Np can be odd or even 


4 


Isolated 


Merged 


Np must be odd 


5 


Shared or none 


Merged 


Np can be odd or even 


6 


Merged 


Isolated 


Np must be odd 


7 


Merged 


Shared or none 


Np can be odd or even 


8 


Merged 


Merged 


Np must be odd 


9 


Isolated/shared 


Shared or none 


Np must be even 




or none 






10 


Shared or none 


Isolated/shared 


Np must be even 






or none 
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It is too space-consuming to list the detailed equations for the source/drain geometric 
parameters for all the cases of the geoMod. In the following, we give the corresponding 
expressions to calculate A sctf , A def f, P seff , and P de ff for the isolated, merged, and shared 
cases, respectively, on the basis of which the detailed expression for each geoMod case 
can be derived. 

The effective area and periphery parameters of the isolated end source/drain regions 
can be calculated by 



AXgff = NX end ■ ( DMCG + DMCI - 2 ■ DMCGT) ■ Weff.cj 

+ NX i„ t ■ 2 ■ ( DMCG - DMCGT) ■ W ctf , cj , (6.229) 

PX e ff = NX tnd ■ [2 ■ (DMCG + DMCI - 2 ■ DMCGT) + Weff.cj] 

+ NX int ■ 4 ■ (DMCG - DMCGT ) , (6.230) 



where AX t ff can be AS e s or AD e ff, the effective source or drain area; PX e ff can be PS e ff or 
PD t ff, the effective source or drain perimeter; NX tn & can be NS e nd or ND m d, the numbers 
of end source or drain regions; NX mt can be NS\„ t or /VD int , the numbers of the internal 
source or drain regions. DMCG is the distance from the center of source/drain contact to 
the gate edge. DMCI is the distance between the center of the source/drain contact and 
the isolation edge in the channel-length direction. The definitions of DMCG, DMCI , and 
Weff.cj can be found in Figure 6.29. DMCGT is a model parameter to make necessary 
corrections to DMCI and DMCG is a model parameter to fit the measured data from a 
specific test structure. 



^drawn ~ L 




Figure 6.29 Illustration of the definitions of the layout-dependent model parameters 
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The effective area and periphery parameters of the shared end source/drain regions can 
be calculated by 



AZ eff = NX e nd ■ ( DMCG - DMCGT) ■ W eff , cj 

+ NX un ■ 2 ■ ( DMCG - DMCGT) ■ W eff , cj , (6.23 1) 

PX t ff = /VZ end ■ 2 ■ ( DMCG - DMCGT) + NX int ■ 4 ■ ( DMCG - DMCGT). (6.232) 

The effective area and periphery parameters of the merged end source/drain regions 
can be calculated by 

AZeff = AZ end • (DM DC, - DMCGT) ■ lT e ff, CJ 

+ NX mt ■ 2 ■ (DMCG - DMCGT) ■ WTff.cj, (6.233) 

PZ ef f = NX tnd ■ 2 ■ ( DMDG - DMCGT) + NX m ■ 4 ■ (DMCG - DMCGT). (6.234) 



where DMDG is the distance between the center of a merged source/drain junction and 
the gate edge. 

The numbers of Z xen ,i and /¥ xmt depend on the AT and MIN parameters. If AT is odd, 
ATend is T and Z x i nt is determined by ( Ny- I j/2. If Nf is even, the calculation of ATend is 
different depending on the MIN parameter. If MIN — 0, it means that the number of drain 
contacts is to be minimized. In other words, the outer junction regions of the device with 
even finger will be considered as source regions. If MIN — 1, it means that the number 
of source contacts is to be minimized, that is, the outer junction regions of the device 
will be considered as drain regions. The calculation of ND end , NS en( i~ ND int , and AT^nt for 
even number of fingers are given in the following: 



A® end 



Attend 



ND mt 



NS;, 



2 MIN 


= 1 


0 MIN ^ 1 ’ 


0 MIN 


= 1 


2 MIN / 1 ’ 


AT 


MIN = 1 


~2~ ~ 






MIN ^ 1 


~2 


^Vp 


MIN = 1 


~2~ 


AT | 
~2 ~ 


MIN / 1 



(6.235) 

(6.236) 

(6.237) 

(6.238) 



6.16.2 Source/drain Diffusion Resistance Calculation 

The calculation of source/drain diffusion resistance in BSIM4 depends also on the layouts 
discussed above. A model parameter rgeoMod is introduced to define the method calcu- 
lating the source/drain diffusion resistances to account for the fact that the path of current 




LAYOUT-DEPENDENT PARASITICS MODEL 205 




Figure 6.30 Different source/drain resistance components _R w ide. /? P omt, and R mel ge d. depending on 
the device layouts 



flow at the source/drain and hence the source/drain resistance could be different owing 
to different layout structures with specific type of contacts - point, wide, or merged as 
shown in Figure 6.30. 

When rgeoMod = 0, no source/drain diffusion resistance 7? sdif f or 7? ddif f is generated. 

When rgeoMod ^ 0, if the instance parameter NRS or NRD, for the number of source 
or drain squares, is given, the source/drain diffusion resistances ( i? sd iff and /? ddlt f ) are 
calculated with the following simple equations: 

7? sdif f = NRS ■ RSH, (6.239) 

R dm = NRD ■ RSH , (6.240) 



where RSH is the sheet resistance of the source/drain diffusion region. 

When rgeoMod / 0 and the instance parameter NRS or NRD is not given, the source/ 
drain diffusion resistances (i? S diff and A\i d ,tr ) are calculated from model parameters NF, 
DWJ, geoMod, DMCG, DMCI , DMDG with equations that are a little more complex. 
The equations calculating i? w ; de and /? pomt can be given by 



/'"wide — 



/'"point — 



RSH ■ ( DMCG - DMCGT) 

VTetfcj 

RSH ■ Weffcj 

3 ■ ( DMCG + DMCI - 2DMCGT) ' 



(6.241) 

(6.242) 



BSIM4 assumes all interjunctions to be shared junctions with wide contacts. So 
Eq. (6.241) will be used in most of the cases to calculate the source/drain diffusion 
resistance. The point contact is only used at an end junction. 

The diffusion resistance of a merged junction (no contacts) can be calculated with the 
following equation: 



R 



merged — 



RSH ■ (DMDG - DMCGT) 

Weffcj 



(6.243) 
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Table 6.4 Options for source/drain contacts selected with rgeoMod 



rgeoMod 


Source end contact 


Drain enc contact 


Intercontact 


0 


No R s 


No R d 


Wide 


1 


Wide 


Wide 


- 


2 


Wide 


Point 


Wide 


3 


Point 


Wide 


Wide 


4 


Point 


Point 


Wide 


5 


Wide 


Merged 


Wide 


6 


Point 


Merged 


Wide 


7 


Merged 


Wide 


Wide 


8 


Merged 


Point 


Wide 



RgeoMod is the model selector to determine which of the above equations is to be 
used to calculate the diffusion resistance, depending on whether the contact is a wide one 
or a point one or a merged one, as defined in Table 6.4. 

Owing to the various options given above, the model implementation of the source/drain 
diffusion resistance becomes sort of complex to cover all the cases. This makes the BSIM4 
source/drain series resistance model a little more difficult to use. In some cases, a sub- 
circuit model with user-defined parasitic components in the intrinsic device model is 
implemented by bypassing the complex calculations. 
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7 

The EKV Model 



7.1 INTRODUCTION 

The EKV model was first published in the original work by Enz, Rrummenacher, and 
Vittoz (Enz et al. 1995). The name of the model is constructed from the first letter of 
each of the authors’ last name. Since the introduction, the model has become popu- 
lar worldwide owing to its simplicity and the relatively few parameters involved. The 
model has been implemented in many commercial circuit simulators such as Star-Hspice 
( www.avanticorp.com ), Eldo ( www.mentor.com ), and AIM-Spice ( www.aimspice.com ). 
Visit the EKV Web site at http://legwww.epfl.cli/ek\’/ for further information on the avail- 
ability of the model in other simulators. 

In this chapter, we first present a list of model features to give you a quick glimpse 
of the capabilities of this model. Then, we discuss the backbone of the EKV model that 
consists of the expressions for the drain current and terminal charges of long-channel 
MOSFETs. Then extensions to the basic model are discussed. The model equations and 
parameters presented in this chapter are based on version 2.6 of the model. At the time 
of writing, version 3.0 of the EKV model was being developed. Unfortunately, not all 
equations of this new major release had been fixed at that time. Therefore, we are not 
able to present the model here. However, at the end of this chapter we list new features 
known at the time of writing (October 2002). 

7.2 MODEL FEATURES 

As mentioned in the introduction, the EKV model has gained in popularity owing to 
its simplicity. In addition, the EKV model has many other attractive features, which are 
listed below. 

• Uses the bulk node as the reference for all voltages. 

• Symmetric forward/reverse operation of the MOSFET. 

• Modeling of weak and moderate inversion to provide accurate predictions of low- 
voltage, low-current designs. 

• Geometry- and process-related dependencies. 

• Effects of doping profiles. 
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• Single-expression model that preserves continuity of all derivatives for improved con- 
vergence in circuit simulations. 

• High-field, short-channel, and narrow width effects including mobility degradation, 
velocity saturation, channel-length modulation (CLM), source and drain charge-sharing, 
and reverse short-channel effect (RSCE). 

• Modeling of substrate current due to impact ionization. 

• Charge-based dynamic model including a first-order non-quasi-static (NQS) description 
of the transadmittances. 

• Thermal and flicker noise modeling. 

• Estimation of short-distance geometry- and bias-dependent device mismatch properties. 

• Emphasis on accurate modeling of the inversion coefficient in all regions of inversion. 
This feature is important when the circuit designer is using the gm/Ip, design concept. 

• Consistent normalization of device characteristics, in particular for drain current. 

Many of the features listed above are discussed in the following sections. 



7.3 LONG-CHANNEL DRAIN CURRENT MODEL 

The other MOSFET models discussed in this book use the source node as the reference 
node. The EKV model has adopted a different approach and has assigned the bulk node as 
the reference node. The argument by the authors in Enz et al. (1995) is that the bulk node 
is chosen as the reference node to exploit the intrinsic symmetry of the MOSFETs. In 
Figure 7.1 an n -channel MOSFET is shown with definitions of the independent voltages 
and the direction of the drain current la- 

The top level expression for the drain current is 

I d = I F ~lR (7-1) 



where 7 F and 7 R are called the forward and reverse currents, respectively. The expression 
of 7p and 7 R is, in its simplest form, as follows: 



'F(R) 



/ Vp — Vsb(db) \ 
= 7 S In 2 1 + e 2v th 



V 



(7.2) 



G 




Figure 7.1 Voltage definitions used in the EKV model 




LONG-CHANNEL DRAIN CURRENT MODEL 211 



This is an interpolating function 1 that tends to a square law for 7 f<r) Is (corresponds 
to strong inversion) and to an exponential for 7p(R) <<C Is (corresponds to weak inversion). 
Hence, the model describes the weak, moderate, and strong inversion regimes of operation 
with a single expression. In (7.2), 7 S is the specific current, V r > is the pinch-off voltage, 
and V* is the thermal voltage. We should mention here that in the implementation of 
the EKV model in computer programs, the semiempirical interpolation used in the orig- 
inal EKV model is replaced with a fully physical charge-based expression (see Bucher 
et al. (1997)). 

The expressions for 7 S and Vp are given below. 



Is = 2npvl 


(7.3) 


»= kp t- 


(7.4) 




(7.5) 


Vgb — Vgb — Vxo + 0 + Y \/0 > 


(7.6) 


1,-1+ * 

2V^p + 0 + 4Vth 


(7.7) 



Here, n is the slope factor, KP is the transconductance parameter, W and L are the 
effective gate width and length, respectively, 0 is the bulk Fermi potential, y is the body 
effect factor, and y T0 is the nominal threshold voltage at zero bias. The quantity V gb 
in (7.6) is called the effective gate voltage. The basic set of model parameters required 
for the long-channel drain current model is listed in Table 7.1. 

Optionally, instead of specifying the parameter KP, the oxide thickness TOX and 
the low-field mobility UO can be specified and KP will then be calculated. From 
TOX, the oxide capacitance C ox can be calculated as C ox = t: 0 f ; ,/TOX, where e 0 (= 
8.85418 x 10 12 F/m ) is the permittivity in vacuum and S\ is the dielectric constant of 
the oxide material. Strictly speaking, the only parameter that must be specified for the 
charge/transcapacitance model, as well as thermal noise model, is TOX (or C ox ). 

Note that since all voltages in the EKV model are referred to the bulk node of the 
transistor, the model parameter value specified for V T() is also referred to the bulk node. 



Table 7.1 Long-channel drain current model parameters 



Model parameter 


SPICE name 


Description 


Default value 


Unit 


Y 


GAMMA 


Body effect parameter 


1 


v 1 / 2 


KP 


KP 


Transconductance 

parameter 


50 x 10~ 6 


A/V 2 


0 


PHI 


Bulk Fermi potential 


0.7 


V 


Vto 


VTO 


Nominal threshold 
voltage 


0.5 


V 



1 This is not the default interpolation function in the EKV model implementation in AIM-Spice. To use this 
function, model parameter EKVINT has to be specified with a value different from zero. 
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7.4 MODELING SECOND-ORDER EFFECTS 
OF THE DRAIN CURRENT 

Second-order effects of the drain current in MOS transistors include high-field effects 
and short- and narrow-channel effects. In the EKV model, the following second-order 
effects are included: velocity saturation, mobility degradation, channel-length modulation, 
charge-sharing, and reverse short-channel effect. In this section we will discuss how the 
core long-channel drain current model is modified to take into account these effects. 



7.4.1 Velocity Saturation and Channel-length Modulation 



The effects of velocity saturation and CLM are taken into account in the EKV model 
by replacing the length L in the expression for ft in (7.4) with an equivalent length L eq 
as follows: 



ft = KP 



W 



-eq 



(7.8) 



where L e q is given by 



L tq — L AT T 



Vds + V, 



ip 



A L = XL c In 1 + 



^Cl'it 

V ds - V ip 



L( E c 



Lr = 



Vdsx — 



^O^si 
Cox J 

v db - Vsb 



V ip = 7 v d 2 ss + AV 2 - V(V dsx - V DSS ) 2 + AV 2 , 
AV = 4V th 



N 



. , h_ _ Vdss 

I ' 64 



1 

+ T7 ! 



Vdss — Vc 



N 

Vq — E CI i t L. 



I i Yi i h. _ I 

4 \^VT s ~2 



(7.9) 

(7.10) 

(7.11) 

(7.12) 

(7.13) 

(7.14) 

(7.15) 

(7.16) 



Most of the parameters used in the equations above are model parameters and are listed 
in Table 7.2. In addition. £ s ; is the relative permittivity of silicon (=11.7). 

In the equation for L eq in (7.9), the second term on the right side represents a reduction 
in the effective channel length due to CLM, while the last term on the right side represents 
an increase in the effective channel length due to velocity saturation. This approach of 
modeling the effect of velocity saturation is based on the work by Arora et a 1. (1994). 
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Table 7.2 Parameters used for modeling second-order effects in the drain current 



Model parameter 


SPICE name 


Description 


Default value 


Unit 


Cox 


COX 


Oxide capacitance 


7 x 10“ 4 


F/m 2 


Ccrit 


UCRIT 


Longitudinal critical 
electric field 


2 x 10 6 


V/m 


A 


LAMBDA 


Channel-length 

modulation coefficient 


0.5 


— 


Lv 


LETA 


Coefficient for 

short-channel effect 


0.1 


- 


Lk 


LK 


Characteristic length for 
RSCE 


0.29 x 10' 6 


m 


Qo 


QO 


Peak charge density for 
RSCE 


0 


C/m 2 


9 


THETA 


Mobility reduction 
coefficient 


0 


v _1 


Wrj 


WETA 


Coefficient for 

narrow-channel effect 


0.25 


- 


x j 


XJ 


Junction depth 


0.1 x 10“ 6 


m 



7.4.2 Mobility Degradation due to Vertical Electric Field 



There are two models available in EKV for taking into account the reduction in the mobil- 
ity of carriers caused by the vertical electric held - one simple and one more advanced. 
Only the simple model is discussed here. For a description of the advanced model, we 
refer to the EKV user manual by Bucher et al. (1997). If the model parameter EO is not 
specified, the simple model is used. In the simple model the equation for ft is modified 
as follows: 



P = KP ■ 



W 1 
I7 q 1 + eVp’ 



(7.17) 



where 9 is a model parameter called the mobility reduction coefficient. 



7.4.3 Effects of Charge-sharing 

At short gate lengths, the depletion widths associated with the source -channel and 
drain-channel junctions may represent a significant fraction of the total gate length. 
This depletion charge is shared with the depletion charges of the contact junctions and 
causes a reduction in the total depletion charge. The macroscopic manifestation of this 
effect is a reduction in the effective threshold voltage as the gate length is decreased. In 
addition, since the depletion widths are voltage-dependent, the effective threshold voltage 
also becomes dependent on the drain-source voltage. This effect is called drain-induced 
barrier lowering (DIBL) in many texts. 

The effects of charge-sharing are included in the EKV model by introducing an effective 
body effect parameter y' as follows: 



Y = Y 



_£si 

Co 




VsB + 




3 W n 
W 



yvp+0 



(7.18) 
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Here, L rj and W n are the short-channel-effect and the narrow-channel-effect coefficients, 
respectively. They are both model parameters and are listed in Table 7.2. The effective 
body effect parameter y' is now used wherever y appears in the equations above. Notice 
that y' decreases with decreasing L and increasing V S b and Vdb as is expected from the 
discussion above. 

For analog circuit designers, the most important implication of charge-sharing is that 
the output resistance of the transistors drops drastically. This is illustrated in the SPICE 
example presented in Section 7.5. 



7,4.4 Reverse Short-channel Effect (RSCE) 

The reverse short-channel effect is caused by nonuniform doping along the channel. The 
effect is that the threshold voltage is larger at the edges of the channel than in the middle. 
Hence, as the channel length is decreased, the effective threshold is increased. In the EKV 
model, the RSCE is taken into account by adding an extra term in the expression for the 
effective gate voltage V^ b . The modified version of (7.6) then becomes 



where 



Vg b = fgb — Vto + Vrsce + (p + K\/0 



(7.19) 



ErscE = 



2Q 0 



C 0 



1 + 2 (? + + C t J 



« = c,(> o A-‘), 

C A = 0.028; C s = 4(22 x 1(T 3 ) 2 . 



(7.20) 



(7.21) 

(7.22) 



In the equations above, Lk and Qo are model parameters and are listed in Table 7.2. 

With the discussion of the reverse short-channel effect, we end this section on second- 
order effects and the drain current model. The EKV model also includes another contribu- 
tion to the drain current, which is the impact ionization current. However, the equations 
implemented in the EKV model are the same as the equations discussed in Chapter 1 and 
are not repeated here. 

Before we continue with the dynamic model, we take a break from the listing of 
equations and present a SPICE example of how second-order effects affect important 
small-signal characteristics of the transistors. 



7.5 SPICE EXAMPLE: THE EFFECT 
OF CHARGE-SHARING 

As mentioned in Section 7.4.3, the most important implication of charge- sharing for ana- 
log circuit designers is that the output resistance of the transistors drops drastically. In 
this SPICE example we will use the AIM-Spice implementation of the EKV model to 
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The effect of charge -sharing 

* Circuit description 

ml d g 0 0 mn 1=0. 5u w=10u 
vgs g 0 dc 0.7 
vds dd 0 dc 2.5 
vid dd d dc 0 

* Simulation command 
.dc vds 0 5 0.01 
.plot dc i (vid) 

* MOS model definition 

.model mn nmos level=23 cox=3.45m xj=0.15u 
+ vto=0.6 gamma=0.71 phi=0.97 kp=150u e0=88e6 
+ ucrit=4.5e6 dl=-0.05u dw=-0.02u lambda=0.23 
+ ibn=1.0 iba=200e6 ibb=350e6 
+ weta=0.05 q0=280u lk=0.5u rsh=510 
+ leta=0 . 28 

Figure 7.2 The SPICE netlist used in the charge-sharing example 



illustrate how pronounced this effect can be. The SPICE input file used in this example 
is shown in Figure 7.2. 

The model parameters are the same as those published on the EKV home page. This 
parameter set is said to reflect a typical 0.5-p.m CMOS process. We use a minimum 
length transistor supported by this parameter set that is 0.5 |im. As we note from the 
value of the vto parameter, the nominal threshold voltage of this process is 0.6 V. Fur- 
thermore, we intend to bias the transistor only 100 mV above the nominal threshold 
voltage. We have chosen to use a low gate voltage because the effect of charge-sharing 
is less for high gate voltage overdrives. Hence, we have specified a gate-source voltage 
of 0.7 V. 

We want to run a DC simulation sweeping the drain-source voltage from zero to 
5 V. The corresponding simulator command is shown in the line starting with . dc in 
Figure 7.2. In the following line we request that the simulator plot the current through 
the voltage source vid, which is used as an ampere meter to measure the drain current 
of the transistor. In many SPICE simulators, including AIM-Spice, the drain current can 
be plotted directly without having to insert an extra voltage source. However, we have 
chosen to use the syntax and capabilities of standard Berkeley SPICE. 

Well, let us take a look at the results. In Figure 7.3 we have plotted the simu- 
lated drain current. The thick and thin line represent the drain current with the charge- 
sharing effect enabled and disabled, respectively. To disable the charge-sharing effect, the 
model parameters let a and wet a are set to zero in Figure 7.2. 

We note from Figure 7.3 that there is quite a large difference in the two curves. In 
the saturation region, both the current level and the slope are quite different. Since we 
are most interested in the output resistance, we also plot the inverse of the slope of the 
two curves in Figure 7.3. This is shown in Figure 7.4, where again the thick and thin line 
correspond to enabled and disabled charge-sharing effects, respectively. 

Notice that in the saturation region, the output resistance is reduced by a factor of 
about 10 because of charge-sharing. In many amplifier topologies this means the same 
factor of reduction in the gain. 
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Drain-source voltage (V) 



Figure 7.3 Simulated drain current of the circuit shown in Figure 7.2 with charge-sharing enabled 
(thick line) and disabled (thin line) 




Figure 7.4 Simulated output resistance of the circuit shown in Figure 7.2 with charge-sharing 
enabled (thick line) and disabled (thin line) 



7.6 MODELING OF CHARGE STORAGE EFFECTS 

The equations listed in the previous sections are only good for modeling the DC character- 
istics of a MOS transistor. To be able to predict the dynamic performance of MOSFETs, 
a model for the capacitances or stored charges is also required. In the EKV model, the 
effects of charge storage within the intrinsic MOS transistor are implemented using a 
quasi-static charge-based approach with a first-order extension to NQS operation in AC 
analysis. The charge storage model is based on equations for the charge stored at each 
of the four terminals of the transistor. By forcing the sum of the terminal charges to be 
equal to zero, charge conservation is assured. 
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The starting point for the charge storage model is the slope factor for charge calculations 
, which is given by 



n 



q 



1 + 



y 

2%/Vp + cj) 



(7.23) 



To split the total inversion charge between the source and the drain terminals, empirical 
fitting is done as follows: 



x f = 




X r 




(7.24) 

(7.25) 



The normalized drain and source terminal charges q D and q s are now given by 



4 d = — «q 



4s — ^q 



/ 4 3x r 3 + 6x r 3 Xf + 4x r x 2 + 2x 3 
\ 15 (x f + x r ) 2 

/ 4 3x 3 + 6x 3 x r + 4xfX r 2 + 2x 3 
V 15 (x f + x r ) 2 




The total normalized inversion charge is the sum of q\> and qs: 



( 4 + x f x r + x 2 

4i — 4 d + 4s — — ”q t ; 

Xf -E x r 




(7.26) 

(7.27) 



(7.28) 



The bulk charge is given by 



4b 



-yVW+1) — 

Vth 



V', 



GB 



v th 




for V'gb > 0 
for V' GB < 0 



(7.29) 



The conservation of charge is assured by forcing the normalized gate charge to be the 
negative sum of the other charges: 



4g = — 4i — 4 b- 



(7.30) 



The total terminal charges can now be written in terms of their normalized counterparts 
as follows: 



8b — lwc 0X v M b , 


(7.31) 


<2d = LWC ox V t hqr > , 


(7.32) 


Q g = 7-4TC 0X Vth4G> 


(7.33) 


Qi — LVTC 0X Vth4i, 


(7.34) 


Qs - LWC m V th q s . 


(7.35) 
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The intrinsic capacitances can now be obtained from the expressions of the terminal 
charges as follows: 

9 Gy 

C xy = ± — (7.36) 

8Vy 

where x, y = B, D, G, S and the positive sign is chosen when x — y, otherwise the 
negative sign is used. Taking the different derivatives, the result is simple, analytical, 
and continuous expressions for all transcapacitances in terms of Xf, x r , the pinch-off 
voltage and the slope factor, and derivatives thereof, from weak to strong inversion and 
nonsaturation to saturation. A simpler model using the five intrinsic capacitances can 
be obtained when neglecting the slight dependence on the slope factor, resulting in the 
following simple expressions: 



c gs = - C ox LW 



Cgd - -C m LW 



1 - 



2 1 1 

X v ~ X r + 2 Xt 

(X r + Xf ) 2 



1 - 



2 1 1 

Xf - Xf + -X r 

(X r + Xf ) 2 



C gb = C ox LW 



n n - 1 



1 - 



Cgs + Cgd 

CoxLW 



Csb — (ttq l)Cg S , 

Cdb — ' (ttq 1 ) Cgd . 



(7.37) 



(7.38) 



(7.39) 

(7.40) 

(7.41) 



The EKV model also includes extrinsic capacitances. However, the equations implemented 
in the EKV model are the same as the equations discussed in Chapter 1 and are not 
repeated here. 



7.7 NON-QUASI-STATIC MODELING 

As discussed in Chapter 1, for very high frequencies, comparable to the inverse carrier 
transport time of the channel, the MOSFET models should consider the distributed nature 
of the channel charge. The EKV model includes a first-order NQS model for small-signal 
AC analysis. The expression of the NQS drain current is obtained from the quasi-static 
value of the drain current, which is then first-order low-pass filtered according to 



h iCQ = 



A I quasi— static 

1 + NQS ■ ~s ■ t ' 



(7.42) 



Here, NQS is a model parameter allowing the NQS model to be disabled and r is a 
characteristic time constant given by 



4 Xf + 3xfX r + x 2 

z — Tq - 



15 (xf + x r ) 3 



(7.43) 
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where r 0 is the intrinsic time constant, which is given by 

L 2 

To = , 

2/r V 



(7.44) 



where fi is the mobility of the carriers. 



7.8 THE NOISE MODEL 

In the EKV model, the inherent noise of MOS transistors is modeled by a current source 
INDS between intrinsic source and drain. It is composed of a thermal noise component 
and a flicker noise component and has the following power spectral density (PSD): 

‘S'[ X [)S — ^'thermal T“ ‘^'llicken (7.45) 

where the thermal noise component is given by 



^thermal = T fi \ q\ | . (7 .46) 

Here, /q 5 is the Boltzmann constant, T is the absolute temperature in Kelvin, and c/\ is 
the normalized inversion charge given by (7.28). Note that (7.46) is valid in all regimes 
of operation including below threshold and for small V^s . 

The flicker noise PSD is given by 



^flicker 



KF-g m 

WLC 0 xf AF ’ 



(7.47) 



where KF and AF are model parameters and / is the frequency of operation. 



7.9 TEMPERATURE EFFECTS 



The modeling of temperature effects as well as default values may be implementation- 
dependent. Therefore, please consult the documentation of your simulator. 

The temperature appears explicitly in exponential terms through the thermal voltage, 
which is given by 




(7.48) 






where q is the electron charge. The temperature dependency of the band gap of silicon 
is modeled according to 



T 2 

E„ = 1.16 - 0.000702 . (7.49) 

g T + 1108 

The nominal threshold voltage Vto is modeled as decreasing linearly with the temperature 
as follows: 



Vto (T) = Vto - TCV(T - T ref ), 



(7.50) 




220 THE EKV MODEL 



where TCV is a model parameter and T rcf is the reference temperature at which model 
parameters are extracted. Other relevant model parameters are made temperature-dependent 
by the following equations: 

( rji \ BEX 

— J , (7.51) 

/ J X UCEX 

^crit(r) = £cri, ( — J , ( 7 . 52 ) 

<P(T) — 3y lh In ( -7—^ — + E g (T). (7.53) 

tref \ t ref / Aef 

Here, BEX and UCEX are model parameters. 

7.10 VERSION 3.0 OF THE EKV MODEL 

As mentioned in the introduction of this chapter, a new major release of the EKV model 
is right around the corner. This upcoming release has been labeled version 3.0 and pro- 
vides many improvements compared to version 2.6 that was described earlier. Several 
papers have been published that present the different new features available. These papers 
are Sallese et al. (2000), Sallese el al. (2000b), Bucher el al. (2001), Porret et al. (2001), 
Sallese et al. (2001), Bucher et al. (2002), Bucher et al. (2002a), Bucher et al. (2002b), 
Enz (2002), Enz et al. (2002b), and Martin et al. (2002). The improvements to version 3.0 
presented in these papers are as follows: 

• Extension of the charge-based model so that all of its aspects from static to dynamic 
operation, including NQS modeling as well as noise, are handled within the same 
coherent framework. 

• Inclusion of important aspects in very deep submicron CMOS related to the use of 
high doping in the channel region and the polysilicon gate. This leads to effects such 
as poly-gate depletion and quantization effects in the channel. 

• Description of effects caused by nonuniformities both in the vertical and the longitu- 
dinal directions. 

• Effects that are a result of nonideal field distributions in short- and narrow-channel 
devices such as, for example, DIBL. 

• Modeling of vertical field -dependent mobility and velocity saturation effects are 
entirely handled within the framework of the charge model. 

• Verification of the model for accurate description of high-frequency and RF operation 
of MOS transistors. 

According to the authors of the new version, the release is scheduled for early 2003. 
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8 

Other MOSFET Models 



8.1 INTRODUCTION 

In this chapter, we describe two other MOSFET models that are used by design engineers. 
The first one is the MOS Model 9 that was developed by Philips engineers and became a 
standard among many designers and foundries in Europe by the mid-nineties. The second 
model presented is the MOSA1 model developed by two of the authors of this book 
(Fjeldly and Ytterdal) together with Michael Shur. Both models were developed in the 
early 1990s. 

The objective of this chapter is to provide a reference of the two models for the circuit 
designers to use in a design situation. Thus, a detailed discussion of the model equations 
and background physics are not given. 



8.2 MOS MODEL 9 

The MOS Model 9 (MM9) was developed on the basis of the ideas presented in the book 
by deGraaff and Klaassen (1990) and was first published in Velghe et al. (1994). 

The main characteristics of this model that ensure proper operations in the analog 
domain are the following: 

• The consistency of current and charge descriptions by using the same carrier-density 
and electrical-field expressions. 

• Accurate prediction of the transition from weak to strong inversion. 

• Accurate prediction of the transition from linear to saturation region. 

• Continuity in all charges and current expressions and their derivatives. 

• A reduced parameter set to describe an individual transistor. 

The model has been implemented in many commercial circuit simulators such as Star- 
Hspice ( http://www.avanticorp.com/ ) and Eldo ( http://www.mentor.com/ ). Visit the MM9 
Web site at http://www-us2.semiconductors.philips.com/Philips_ModeIs/ for further infor- 
mation on the availability of the model. 



Device Modeling for Analog and RF CMOS Circuit Design. T. Ytterdal, Y. Cheng and T. A. Fjeldly 
© 2003 John Wiley & Sons, Ltd ISBN: 0-471-49869-6 





224 OTHER MOSFET MODELS 



Very recently, in June 2002, a new MOS model 11 (MM 11) was released by Philips. 
The new release is a successor of MM9 and was especially developed to give not only an 
accurate description of currents and charges and their first-order derivatives (i.e., transcon- 
ductance, conductance, capacitances) but also of the higher-order derivatives to be able 
to provide accurate description of electrical distortion behavior as discussed in Chapter 5 
of this book. However, since the model has not yet gained widespread implementation 
in commercially available circuit simulators, we have decided to discuss the MM9 here, 
more specifically, version 903 of MM9. 



8.2.1 The Drain Current Model 

The basic model parameters of the drain-source DC current model in MM9 are listed in 
Table 8.1. 



Table 8.1 Basic model parameters of the MM9 model 



Parameter 


Description 


Unit 


A1 


Factor for the weak-avalanche current 


_ 


A2 


Exponent for the weak-avalanche current 


V 


A3 


Factor of the drain-source voltage above which the weak 
avalanche occurs 


— 


ALP 


Channel-length modulation parameter 


- 


BET 


Gain factor for the reference transistor 


A/V 2 


COL 


Gate overlap capacitance per unit width 


F/m 


ETADS 


Substrate bias dependency of the static feedback effect 


- 


ETAGAM 


Substrate bias dependence of DIBL 


- 


ETAM 


Exponent of the back-bias dependence of m 


- 


GAM1 


Coefficient for the drain-induced threshold shift for large gate 
drive 


y(l-»7Ds) 


GAMOO 


DIBL coefficient 


- 


K 


High-back-bias body factor 


yi/2 


KO 


Low-back-bias body factor 


yi/2 


M 


Number of devices in parallel 


- 


MO 


Parameter for the subthreshold ideality factor model 


- 


NFA 


First coefficient of the flicker noise 


l/(Vm 4 ) 


NFB 


Second coefficient of the flicker noise 


l/(Vm 2 ) 


NFC 


Third coefficient of the flicker noise 


1/V 


NT 


Coefficient of the thermal noise 


J 


PHIB 


Surface potential for strong inversion 


V 


THE1 


Coefficient of the mobility reduction due to the gate-induced field 


1/V 


THE2 


Coefficient of the mobility reduction due to the back-bias 


1/V 1 / 2 


THE3 


Coefficient of the mobility reduction due to the lateral field 


1/V 


TOX 


Gate oxide thickness 


m 


VP 


Characteristic voltage of the channel-length modulation 


V 


VSBT 


Limiting voltage of the V* dependence of m and y 0 


V 


VSBX 


Transition voltage for the dual-k-factor model 


V 


VTO 


Nominal threshold voltage with no substrate or drain bias 


V 


ZET1 


Weak inversion correction factor 


- 



Note'. DIBL; drain-induced barrier lowering. 
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Before we start listing the expressions that build up the drain-source current model in 
MM9, we should briefly discuss the geometry scaling principle applied in the model. Take 
a look at the description of the fifth parameter in Table 8.1. It states that the parameter 
BET is the gain factor of the reference transistor. What is the reference transistor? In 
MM9 the reference transistor is usually a long and wide square transistor and all the 
model parameters listed in Table 8.1 are extracted on the basis of measurements of this 
transistor. Therefore, in many implementations of the MM9 model in circuit simulators, all 
the model parameter names include an “R” at the end to indicate that the model parameter 
is for the reference transistor. For example, the parameter THE1 is called THEIR. In this 
chapter, the effective length and width of the reference transistor is denoted as L e ff >re f 
and VTeff.ref, respectively. When the expressions are applied to an actual transistor, most 
of the model parameters are modified using geometry scaling expressions. This feature is 
discussed in Section 8.2.2. 

In the MM9 model the top level drain-source current expression is given by 

n+8i\ , 

VgoVdst — l-z- K 2 st 

7 ds = /3G 3 4 . (8.1) 

{1 + fltVgtl + e 2 (u s - M S0 )}(1 + 03 Vdsl) 

Here, the terms in the denominator make up the mobility model that includes mobility 
reduction due to the vertical field (the term containing 0 \ ), the bulk-source voltage (the 
term containing 0 2 ), and the lateral field (the term containing 0 3 ). In the MM9 model the 
source and drain parasitic resistances are absorbed into the theta values as follows: 



#i _ 0 1X + ( R s + R ( j)/3, 



0 3 = 0 3T — R d /3 — 



THE1 

2 



( 8 . 2 ) 

(8.3) 



Here, R d and R s are the drain and source parasitic series resistances, respectively. Fur- 
thermore, 6]j and 6*37 are the temperature- and geometry-updated versions of the model 
parameters THE1 and THE3, respectively (see (8.48) and (8.50)). 

The effects of channel-length modulation and drain-induced barrier lowering (DIBL) 
are take into account by the MM9 model through the term G 3 in (8.1). This term consists 
of an interpolation expression, which interpolates between the subthreshold and the strong 
inversion regimes of operation by the following expressions: 

G 3 = + (8 . 4) 

cr 1 + g 1 

G 2 = I +«1n(l + VdS ~ f VdSl )> (8.5) 

and 

Gi = e v 8 t2/(2mV ' th> , (8.6) 



where V ds is the drain-source voltage and V* is the thermal voltage. 




226 OTHER MOSFET MODELS 



To assure the continuity of the higher-order derivatives at the saturation voltage, the 
following smoothing function is used for Kisi : 



Km 

hyp 5 (x, x 0 , £) 



hyp,(x,£) 



Kiss 1 



£3 



Si 



V g t3 

Vgta 



hyp 5 (Vds, Kdssi, £ 3 ), 



x 0 - hyp, 





+ \/x 2 + 4 e , 
2 



1 + 5 ! 



1 + 




20 3 V gt3 ’ 
1+Si 



0.3 



Kiss I 

1 + Kdssl 



0.3/ (ko-k)VL \ 

Us V Kbx + (0.1 Kg,, + Kb) 2 / ’ 



l2mV , th ln(l + Gi), Vg t 2 < Vg ta 

1 Vg t2 + 10- 8 V gt2 > Vgta ’ 



2/nV lh ■ 37. 



(8.7) 

( 8 . 8 ) 

(8.9) 

( 8 . 10 ) 

(8.11) 

(8.12) 

(8.13) 

(8.14) 



In (8.13), m is the subthreshold ideality factor, which ideally is 1.0. However, as discussed 
in Chapter 1, in real CMOS technologies, the capacitance of the depletion region under 
the channel makes the ideality factor larger than 1.0. In the MM9 model the sensitivity 
of the ideality factor on the source-bulk voltage is governed by 



/ 7 / \ W 

l U s o \ 

m — 1 + m 0 I — I , 


(8.15) 


V«sl / 




«s0 = y/ffai, 


(8.16) 


u s i = hyp 2 (m s , Mst, 0.01), 


(8.17) 


hyp 2 (x, x 0 , e) = x 0 - hyp,(x - x 0 , e), 


(8.18) 


Us — \J h ] , 


(8.19) 


Mst = VKb, + 0B' 


(8.20) 


hi - hyp,(y sb + 0.50b, 0.01) + 0.50b. 


(8.21) 



The threshold voltage model of the MM9 model takes into account all important effects 
present in modern MOS transistors, which include substrate bias effects, DIBL, and 
static feedback. 

From the expressions listed above, we see that the MM9 model defines two gate 
voltage overdrives, Vg t i and Vg t 2 (Vgta is not really a gate voltage overdrive). It comes as 
no surprise that also two threshold voltages are involved. The first gate voltage overdrive 
is defined as 



V g ,i =hy Pl (V gs - V T i,5- 10- 4 ), 



(8.22) 
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where the threshold voltage in this case is given by 



Vti = Vto + A Vto. 



(8.23) 



Here, 



A Vto — k 



+ 



k \ k 

liyPi ( V s b . V sda . 0.1) T- I - u sx I - u s 

, ko ) ko 



ko J a/* l -hyP4(Vsb, VsbA-,0.1) — «sol , 



(8.24) 



hyp 4 (x , x 0 , s) = hyPi(x - x 0 , e) - hyp^-.ro, e), 



— \! Vsb.v + 1 



(8.25) 

(8.26) 



From (8.23) and (8.24) we note that this first version of the threshold voltage contains 
only the effect of substrate bias. 

The second gate voltage overdrive V gt 2 uses a second threshold voltage that contains 
all the different effects listed above. V gt 2 is defined as follows: 



V gt2 = V gs - V T2 , 
Vj2 = Vti + A Vti, 



V 2 , 

AT? .. T 7 .. S 11 T^ETADS 

A Vti - “To, 2 V* - y t 2 , 77i V ds 



V 2 

v gtx 



Vi + V-, 

gt-i ' v gti 



^ + nit 



sfl 

n = -7T. 



Vo = Too 



UslY 

«s0/ 



(8.27) 

(8.28) 

(8.29) 

(8.30) 

(8.31) 



The total drain current is the sum of 7 ds , the weak-avalanche current, and the leakage 
current of the drain-bulk diode. In most implementations of the MM9 model in circuit 
simulators, a dedicated model called JUNCAP (Velghe 1995) is available for describing 
the bulk-drain and bulk-source diodes. This model is not discussed here. The weak- 
avalanche current / dba in MM9 is given by 



JO, V ds <V dsa 

l / ds a 1 e 1_a2/(Vds_Vdsa)1 , V ds > V dsa ’ 



(8.32) 



where V dsa = a 3 V dssl . 

8.2.2 Temperature and Geometry Dependencies 

The expressions for the effective channel length and width used in the MM9 model are 



T ctt — T drawn + LVAR — 2 ■ LAP 



(8.33) 




228 OTHER MOSFET MODELS 
and 

Weff - Wdrawn + WVAR - 2 ■ WOT, (8.34) 

where L drawn and Wd raw n are gate length and width used on the layout. Furthermore, 
LVAR, LAP, WVAR, and WOT are model parameters that are listed in Table 8.2. 

Table 8.2 Model parameters for geometry and temperature dependencies 



Parameter 


Description 


Unit 


ETAALP 


Coefficient of the length dependence of a 


- 


ETABET 


Exponent of the temperature dependence of the 
gain factor 


_ 


ETAZET 


Exponent of the length dependence of G 


- 


LAP 


Lateral diffusion on each side of the gate 


m 


LVAR 


Difference between the actual and the drawn gate 
length 


m 


SLA1 


Coefficient of the length dependence of a\ 


m 


SLA2 


Coefficient of the length dependence of a 2 


Vm 


SLA3 


Coefficient of the length dependence of a 2 


m 


SLALP 


Coefficient of the length dependence of a 


m w 


SLGAM1 


Coefficient of the length dependence of y\ 


y(l->7 Ds) m 


SLGAMOO 


Coefficient of the length dependence of y 00 


m 1 ' 2 


SLK 


Coefficient of the length dependence of k 


V'/ 2 m 


SLKO 


Coefficient of the length dependence of k 0 


V'/ 2 m 


SLMO 


Coefficient of the length dependence of mo 


m 


SLTHE1 


Coefficient of the length dependence of 9\ 


m/V 


SLTHE2 


Coefficient of the length dependence of 9 2 


m/(V) 1/2 


SLTHE3 


Coefficient of the length dependence of 9 2 


m/V 


SLVBST 


Coefficient of the length dependence of P sbt 


Vm 


SLVBSX 


Coefficient of the length dependence of P sta 


Vm 


SLVTO 


Coefficient of the length dependence of Vt 0 


Vm 


SLVT02 


Second coefficient of the length dependence of Vj 0 


Vm 2 


SLZET1 


Coefficient of the length dependence of G 


m" f 


STA1 


Coefficient of the temperature dependence of a i 


1/K 


STLTHE1 


Coefficient of the temperature dependence of the 
length dependence of 9i 


m/(VK) 


STLTHE2 


Coefficient of the temperature dependence of the 
length dependence of 9 2 


m/(VF 2 K) 


STLTHE3 


Coefficient of the temperature dependence of the 
length dependence of 9 2 


m/(VK) 


STMO 


Coefficient of the temperature dependence of m 0 


1/K 


STTHE1 


Coefficient of the temperature dependence of 6\ 


1/(VK) 


STTHE2 


Coefficient of the temperature dependence of 0 2 


1/(VF 2 K) 


STTHE3 


Coefficient of the temperature dependence of Oj, 


1/(VK) 


STVTO 


Coefficient of the temperature dependence of Vj 0 


V/K 


SWA1 


Coefficient of the width dependence of a i 


m 


SWA2 


Coefficient of the width dependence of a 2 


Vm 


SWA3 


Coefficient of the width dependence of a 2 


m 


SWALP 


Coefficient of the width dependence of a 


m 
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Table 8.2 ( continued ) 



Parameter 


Description 


Unit 


SWGAM1 


Coefficient of the width dependence of yi 


y(l->tas) m 


SWK 


Coefficient of the width dependence of k 


V 1 / 2 m 


SWKO 


Coefficient of the width dependence of k 0 


V 1 / 2 m 


SWTHE1 


Coefficient of the width dependence of 6 i 


m/V 


SWTHE2 


Coefficient of the width dependence of 62 


m /(V) 1 / 2 


SWTHE3 


Coefficient of the width dependence of 63 


m/V 


SWVSBT 


Coefficient of the width dependence of V sbt 


Vm 


SWVSBX 


Coefficient of the width dependence of V s b^ 


Vm 


SWVTO 


Coefficient of the width dependence of V T0 


Vm 


WOT 


Channel-length reduction per side due to the 
lateral diffusion of the source/drain dopant ions 


m 


WVAR 


Difference between the actual and the drawn gate 
width 


m 



These effective quantities are then used together with model parameters to include 
geometry and temperature dependencies of the model parameters as follows: 






BET 



W eff 



-eff 




ETABET 



(8.35) 



where ETABET is a model parameter and T aam is the temperature at which the model 
parameters were extracted. All model parameters related to geometry and temperature 
dependency are listed in Table 8.2. 

The surface potential is updated versus temperature as follows: 



0 B - PHIB + 



PHIB- 1.13-2.5- 10- 4 T nom 
300 



(T - T nom ). 



(8.36) 



The threshold voltage at zero-bias Vfo depends both on geometry and temperature accord- 
ing to the following expression: 



Vjo = VTO + STVTO(7’ - T nom ) + SLVTO 



-eff 



eff, ref 



+ SLVT02 






2 

eff 



T 2 

^ eff, ref , 



+ SWVTO 



u- 



i 



V W e ff Weff re f 



(8.37) 



The channel-length modulation effect and the weak inversion correction parameter are 
made dependent on the geometry by applying the following expressions: 



a = ALP + SLALP ( 

L e ff 

6 eff, ref 



1 



1 






ETAALP 

eff 



t ETAALP 
^ eff, ref , 



+ SWALP 



1 



1 



(. W e ff W e ff ref ) 



(8.38) 



V P = VP 



(8.39) 
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and 

£1 = ZET1 + SLZET1 ( ETAZET — Tetazet ) • (8.40) 

\ L eff ^eff.ref / 

The length dependency of the DIBL parameter yoo is quadratic and is given by 

yoo = GAMOO + SLGAMOO ( 4 - — ) (8-41) 

\ ^eff ^eff.ref / 

The subthreshold slope parameter MO has both temperature and length dependencies that 
are modeled as follows: 



m 0 = MO ■ STMO ■ (T — r nom ) + SLMO 





(8.42) 



The body effect parameters in the threshold voltage model depend on the geometry of 
the actual devices as follows: 

k 0 = KO + SLKO (— - — + SWKO ( — — V (8.43) 

Ueff ieff.ref/ V Weff ^eff.ref / 

k = K+SLk( — — )+SWK( — — V (8.44) 

Ueff L e ff,ref/ V Weff W cffjef J 

and 

V sbl . = VSBX+ SLVSBX (— — ) + SWVBSX (— — ) . (8.45) 

V T e ff L eSM J Weff Weff.ref/ 

Also, the drain-bias dependency of the threshold voltage varies with the geometry of 
the device. The dependency is described by the following two expressions: 



y sbt = VSBT + SLVSBT ( — — ) (8.46) 

\ A e If iefffref / 

and 

yi = GAM1 + SLGAM1 (— — ) + SWGAM1 ( — — ) . (8.47) 

V T e ff Leif ref / Weff Weff, ref / 

The parameters of the mobility model are updated according to the operating temper- 

ature and geometry as follows: 



9 it = THE1 + STTHE1 (T - T nom ) + [SLTHE1 + STLTI IE 1 (T - T nom )] 

■ | — — ) + SWTHE1 ( — — ) , (8.48) 

V^e ff ieff.ref/ V W et f W effjef ) 

9 2 = THE2 + STTHE2(T - 7; om ) + [SLTHE2 + STLTHE2(T - T nom )] 

■(— — ) + SWTHE2 ( — — ) , (8.49) 

Ueff ieff.ref/ V Weff W eff , re f/ 
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<9 3T = THE3 + STTHE3 CT - r nom ) + [SLTHE3 + STLTHE3(r - 7" nom )J 
■(— - — ) + SWTHE3 ( — — \ . 

V C e ff L e ff re f / V VVcff IVeff.ref / 



The parameters for the weak-avalanche current model is updated versus temperature and 
geometry according to 



= A1 + STA1(T - T nom ) + SLA1 ( — — ) + SWA1 (— — ) , 

Ueff Kffref/ V W eff Weff.ref/ 



ct2 — A2 T - SLA2 (— — ) + SWA2 ( — — 

\L e ff Ceff.ref / \ W e ff Weff ref 



/ 1 1 
a 3 = A3 + SLA3 



+ SWA3 



Kt’f Kff.ref / V^eff Weff re f 



1 1 



8.2.3 The Intrinsic Charge Storage Model 

The charge storage model for the intrinsic capacitances in MM9 is charge-based to guaran- 
tee charge conservation. The expressions for the terminal charges in an n -channel transistor 
are as follows (to obtain the expressions for /^-channel devices change the signs): 



2d = -C« [1 V, 3 + A2 - vw (^F, + ^ - j). • 

es = -c„[ly s0 + A2.v«(lF,-i i ^-i) / 



Gs = -C 0X y- v gt3 + A2 ■ v ds2 

Gb = j(Gbs + Gbd), 

Gg = — (Gd + Gs + Gb), 



Cox hyP3(Vgb — v FB , v ds2 + v sb + Kid — v F b, o.03), v gb < Vfb 



Gbd = 



k 2 

ko + + hyp 3 (V g b — Vfb. 

2 \ V ds2 + Vsb + Vxi d -y F B, 0.03) 



K b > K 



gb ^ Ffb 



■ Cox hyp 3 (Vg b - V FB , Kb + Vri - V FB , 0.03), V gb < V FB 



0BS — C ox An -| + i + hyP3(Vgb “ VfB ' , v gb > Vfb. 
“ \ Kb + Vri- Vfb, 0.03) 
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(8.60) 

(8.61) 

(8.62) 

(8.63) 

(8.64) 

(8.65) 

(8.66) 

(8.67) 



+ ko ly/h 2 - hyp 4 (V dh , V sbx , 0.1) - w s0 } , 


(8.68) 


^2 = hypjCVdb + 0.5 </>b, 0.01) + O.50b> 


(8.69) 


r. l / dss2 

s 7 = 0.1 . 

1 + Vdss2 


(8.70) 


From the expressions for the terminal charges listed above, the intrinsic capacitances can 
be obtained by taking the corresponding derivatives of the terminal charges with respect 
to the voltages. If one is interested in the Meyer capacitances (see Section 1.4.4), they 


can be calculated as follows: 




r 9Q o 

Dgs_i — „ .. 

0V z s Wgb 


(8.71) 


9Qq 

Lgd -‘ — a,/ 

0VgS tgs, Vgb 


(8.72) 


and 

r 9 Qg 

Cgd - 1 _ ai/ 

dVgb Tgs, V gd 


(8.73) 
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Here, we have used the subscript “i” to indicate intrinsic capacitances. To obtain the total 
capacitances we have to include the overlap capacitances as shown below. 

Cgs = C gs _ i + WeffM ■ COL. (8.74) 

C gd = C gd _i + W eS M ■ COL. (8.75) 

Capacitances related to the bulk-drain and bulk-source p/i-junctions are also part of the 
equivalent circuit of MM9. In most implementations of MM9 in circuit simulators, a 
dedicated model called JUNCAP (see Velghe 1995) is available for describing the bulk- 
drain and bulk-source capacitances. We refer to this document for a description of junction 
capacitances. 



8.2.4 The Noise Model 

The total noise output of a transistor consists of a thermal and a flicker noise part that 
create fluctuations in the channel current. Because of the capacitive coupling between 
gate and channel regions, current fluctuations are also induced in the gate current. These 
two aspects are covered in MM9 by assigning two correlated noise-current sources, one 
connected between drain and source / nt |, and the other one between gate and source / ng , 
and an uncorrelated noise-current source between drain and source /' nfl . The correlated 
current sources are directional. This is illustrated in Figure 8.1. 

The expressions for the different noise sources in the MM9 model are now presented. 
The contribution from the thermal noise has the following spectral density function: 

5, nlh = N T h 6 , (8.76) 



where 




Figure 8.1 Noise-current sources inserted into the small-signal equivalent circuit of a MOS 
transistor 
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*5 

Il4 

h 



Vdssl 

2V*’ 



1 + 03 Vdsl + j ^1 > 



Vc 



g*3 



1 + Si 



Vis: 



{1 + 01 Vgtl + 02 (w s — M s o)}(l + 03 Vdsl) 
(l + 3l)(l+03Vdsl)Vdsl 
2V gt3 -(l + 50 Vdsl 



( 8 . 79 ) 

( 8 . 80 ) 

( 8 . 81 ) 

( 8 . 82 ) 



The spectral density function used to model the effects of flicker noise is given as 



Su = 






S si + 



( 8 . 83 ) 



which is a parallel combination of the noise spectral density in weak inversion ,S W i and 
in strong inversion 5 s i. The noise spectral density due to flicker noise in weak inversion 
is written as 



Swi — Nfa 



Vth/j 

fN " 2 ’ 



( 8 . 84 ) 



where 



lV e ff . ref A off, ref NFA 
IVeffLeff M 



( 8 . 85 ) 



and 



N" 



£ox 

gTOX 



Vth(m 0 + 1). 



( 8 . 86 ) 



In strong inversion, the flicker noise spectral density is described by 

s = 1 VthAl ■ TOX 2 

/ e ox{l + ^1 Vgtl + 02 (m s — t^so) } 

N 0 + N" 1 22 ' 

■ Af FA ln — + Nfr(No — N l ) + -Nfc(Nq — NO 

Vu,/^ G2 — 1 | (Vfa + A^fbA^l + A^fcA^l 

~1 G ~2 1 (N l + NO 2 )’ 

where 

lV e ff, re fL e ff re f NFB 

lVeff A eff M 

lVeff,ref Aeff.ref NFC 
WeffLeff M~’ 

S 0 X 

q TOX Vgt3 ’ 



Nfb = 
A^fc = 
N 0 = 



( 8 . 87 ) 



( 8 . 88 ) 

( 8 . 89 ) 

( 8 . 90 ) 




THE M0SA1 MODEL 235 



and 



Nl = 



:(Vg t3 - Vdsl)- 



q ■ TOX 

The spectral density of the final noise source i n \ g in Figure 8.1 is expressed as 

(2nf Cox) 2 1 



(8.91) 



Si . = Nj- 

3 gn 



1 + 0.075 C^lSl 

V 8m 



2 ’ 



(8.92) 



8.3 THE MOSA1 MODEL 

The MOSA1 model is based on our unified charge control model (UCCM) and the uni- 
versal FET model (see, for example, Shur et al. (1992); Lee etal. (1993); Fjeldly et al. 
(1998)). These models exhibit improved accuracy over the basic MOSFET models in 
several areas that will be explained later. Most importantly, both subthreshold and above- 
threshold currents are accounted for by using UCCM for the mobile channel charge 
density. The model also accounts for velocity saturation in the channel, the channel- 
length modulation (CLM), the threshold voltage shift due to drain bias-induced lowering 
of the injection barrier between the source and the channel (DIBL), the effects of bulk 
charge, the bias-dependent average low-field mobility, and self-heating. Another impor- 
tant feature of the model is the utilization of a single continuous expression for the drain 
current, which is valid in all regimes of operation, effectively removing discontinuities in 
all derivatives, thereby improving the convergence properties of the model. The model 
parameters, such as the average low-field mobility, the saturation velocity, the source and 
drain resistances, are extractable from experimental data using a direct extraction method 
described in Shur et al. (1992). 

Clearly, the subthreshold current is very important since it has consequences for the 
bias and logic levels needed to achieve a satisfactory off-state in digital operations. Flence, 
it affects the power dissipation in logic circuits. Likewise, the holding time in dynamic 
memory circuits is controlled by the magnitude of the subthreshold current. 

To correctly model the subthreshold operation of MOSFETs, we need a charge control 
model for this regime. Also, in order to avoid convergence problems when using the 
model in circuit simulators, it is preferable to use a UCCM that covers both the above- 
and below-threshold regimes in one continuous expression. One such model is the unified 
charge control model (UCCM), which is described below (see also Section 1.5). 



8.3.1 The Unified Charge Control Model 

The standard Metal Insulator Semiconductor (MIS) charge control model described in 
Chapter 1 postulates that the interface inversion charge of electrons, qn s , is proportional 
to the applied voltage swing, y gt - v gs — V T , where Vg S is the gate-source voltage and 
Vt is the threshold voltage. This model is an adequate description of the strong inversion 
regime of the MIS capacitor, but fails for voltages near and below Vt (i.e., in the weak 
inversion and depletion regime). Byun et al. (1992) proposed a new UCCM, which was 
applied to HFETs, MOSFETs, polysilicon thin film transistors (TFTs), and other FET 
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devices (see also Moon et al. (1990); Moon et al. (1991); Park et a 1. (1991); Park (1992)). 
According to this model, the electronic inversion charge in the FET channel is related to 
the applied voltage swing as follows: 



Vgt - aV F = q Vth hi + «(«s ~ n Q ). (8.93) 

Here V F is the quasi-Fermi potential measured relative to the Fermi potential at the source 
side of the channel, a is the bulk effect parameter, n 0 is the sheet density of carriers at 
threshold, rj is the ideality factor in the subthreshold regime, V* is the thermal voltage, 
and a ~ q /c t , Cj = s, /d\ is the effective gate capacitance per unit area, £j is the dielectric 
permeability of the gate insulator, and d\ is the thickness of the gate insulator (including 
a quantum correction to the effective insulator thickness). 

We note that in strong inversion, the linear term will dominate on the right-hand side 
of (8.93) and the drift current will dominate the charge transport in the channel. Hence, 
according to the theory of charge transport, V F can usually be replaced by the channel 
potential V. Therefore, the UCCM reduces to a form of the simple parallel plate capacitor 
model above threshold. 

In order to understand the origin of the bulk effect parameter a. we consider the 
expression for the above-threshold inversion sheet charge density in a MOSFET given by 

qn s (x) = Ci[V G s - Vtx - V(x)l (8.94) 



where, assuming a constant substrate doping A a , 



Vjx — Vfb + 2 <pb + ^2s % qN a [2(p b + V (x) — VbsI/O- (8.95) 

Here Vtx may be regarded as a generalization of the threshold voltage Vj of (1.18) in 
Chapter 1, where the dependence of the depletion charge density on the local channel 
potential V is included. Linearizing (8.95) with respect to V, we obtain V TX ~ Ft + 
(a — l)V where a — 1 + \-^/2B s qN a /(2(pt — Fbs)/ci. Hence, the strong inversion charge 
control equation, including the effect of the variation in the depletion charge along the 
channel, can be written as qn s ~ cTVgt ~ aV). In practical modeling, a may be regarded 
as an adjustable parameter with a value close to unity. 

Below threshold, the linear term on the right-hand side of (8.93) can be neglected. 
This allows us to estimate the subthreshold drain-source current as follows in terms of 
the gradient of the Fermi potential: 



dV F 

^ds = qn.Wfi,,—— 
ax 



q q WD „ d« s 
a dx 



(8.96) 



Here we used quasi-equilibrium electron statistics that relates the electron density to the 
Fermi potential. We also used the Einstein relation. £)„ = //„ F th , to relate the mobility to 
the diffusion coefficient. The derivative of n s with respect to lateral position x in (8.96) 
indicates that the subthreshold current is a pure diffusion current. However, for short 
channels, the lateral variation of the channel potential may be significant, as indicated in 
Figure 1.23, giving rise to a drift component in the transport mechanism. Moreover, the 
injection of charge carriers at the source and drain contacts is dominated by thermionic 
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emission. In fact, for very short channels, the latter may be the dominant overall transport 
mechanism, similar to what takes place in a Shottky barrier. 

Finally, we note that an approximate, analytical solution of the UCCM equation (8.93) 
can be obtained from that of the MOS capacitor in (1.38) by replacing Vgt by Vgt — olV f, 



that is, 




(8.97) 



8.3.2 Unified MOSFET I -V Model 

The basic analytical models discussed in Section 1 .4 need improvements in several areas, 
especially for application to modern MOSFET technology. This issue is addressed in the 
unified drain current model presented here. Most importantly, the subthreshold regime is 
considered in terms of the unified charge control modeling approach. Furthermore, the 
universal model also accounts for series drain and source resistances, velocity satura- 
tion in the channel, gate-bias dependence of mobility, impact ionization, channel-length 
modulation (CLM), and for the threshold voltage shift caused by DIBL. In order to 
avoid convergence problems and excessive computation time in circuit simulations, con- 
tinuous (unified) expressions are used for the current -voltage and capacitance-voltage 
characteristics, covering all regimes of operation. 

Another important feature of the model is the use of relatively few, physically based 
parameters, all of which are extractable from experimental data. Details of the parameter 
extraction procedures for the unified MOSFET model are found in Lee et at. (1993). 

In order to establish a continuous expression for the I — V characteristics, valid in all 
regimes of operation, we start by expressing the drain current in each regime and then 
make smooth transitions between them. Moreover, we want the model to be extrinsic, that 
is, to include the effects of the parasitic source and drain resistances, R s and R d (note that 
the corresponding intrinsic model is recovered simply by setting all parasitic resistances 
to zero). These resistances can be accounted for by relating the intrinsic gate-source 
and drain- source voltages, Vgs and Vgd, considered so far to their extrinsic (measured) 
counterparts, V gs and V gd , by including the voltage drops across the series resistances: 

V GS = V gs - R s I d , (8.98) 

V DS = V ds - (R s + R d )! d . (8.99) 



Note that by applying (8.98) in the saturation current expression (1.54) for the simple 
velocity saturation model and by solving with respect to 7 sat , we obtain the extrin- 
sic expression 



1 + pRsV , gt + yi + 2 /D? s V gt + (V gt /V L ) 2 



( 8 . 100 ) 



where V'gt = Vg S — Vj is the extrinsic gate voltage overdrive and ft — W 11,,8,/d, L is 
the transconductance parameter. The extrinsic saturation voltage V sat can likewise be 
expressed as 

V sa t — V gt + ( R d — 




( 8 . 101 ) 
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Similar analytical expressions can also be obtained for the velocity saturation model based 
on other velocity-field expressions such as (1.52) by Sodini (for « -channel MOSFETs) 
and (1.53) with m = 1 (for /^-channel MOSFETs). 

For drain-source voltages in the linear regime, well below the saturation voltage, the 
drain-source current can be written as 

fds % gchiEns ~ gchVds, (8.102) 

where g C hi and y c |, are the intrinsic and the extrinsic channel conductances in the linear 
region, respectively. The linear conductances are related by 



<?ch — 



<?chi 

1 + <gchi(^s + Rd) 



(8.103) 



In deriving (8.103), we assumed that the drain current is so small that the transconductance 
is much less than the channel conductance. The linear intrinsic channel conductance can 
be written as, using (8.97) for n s and setting Vf = 0, 



,?chi — 



qn s W fj. n 

L 



2 Wq^niip 

L 



In 



1 

1 + - exp 




(8.104) 



We note that near threshold and in the subthreshold regime, the channel resistance will 
normally be much larger than the parasitic source and drain resistances, allowing us to 
make the approximations V ds ~ F DS and V gs ~ V G s, and g ch ~ g chi . 

The next task is to bridge the transition between the linear and the saturation regimes 
with one single expression. For this purpose, we have proposed the following extrinsic, 
universal interpolation expression (Shur el al. 1992): 



&chVds(l + ^^ds) 

[l + (gchV ds // sa tr] 1/m ' 



(8.105) 



Flere, m is a parameter that determines the shape of the characteristics in the knee region. 
The factor (1 + kV ds ) is used for describing the finite output conductance in saturation, 
mainly caused by CLM, that is, the effect of the finite extent of the saturated region of 
the channel (see Figure 1.12). We note that (8.105) has the correct asymptotic behavior in 
the linear regime, in agreement with (8.102). In addition, when X = 0, / d asymptotically 
approaches / sat in saturation, as required. 

In Figure 8.2, we illustrate the link between a typical experimental MOSFET I-V 
characteristic and the saturation voltage 7 sat , the linear extrinsic channel conductance y C h, 
and the finite output conductance in saturation g C h S . When CLM is the cause of the latter, 
we have g C hs ~ k/ sat in deep saturation. This result can be used to determine X as part 
of the device characterization. A more detailed discussion of the effect of channel-length 
modulation is given in Section 1.5.2. 

In the subthreshold regime, we can calculate the saturation current from (8.96), mak- 
ing use of the subthreshold approximation of the UCCM expression, that is, n s ^ n 0 
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Figure 8.2 Typical experimental FET I-V characteristic with finite output conductance in 
saturation 



exp[(Vg t — ocVys)/i)V th]- Straightforward integration over the channel length, correspond- 
ing to a variation in Vp from 0 to Vd S , gives the following idealized expression for the 
subthreshold current: 



/ 



sub 



qn 0 qD„W 

aL 



exp 




1 — exp 




(8.106) 



From this expression, the subthreshold saturation current is obtained for Vds > 2 V t h, where 
the term in the square bracket approaches unity, causing the dependence on Vds to vanish. 
However, as indicated earlier, the inclusion of short-channel effects in the model will 
restore some degree of drain-bias dependence in the saturated subthreshold regime. 

A remaining task is now to join the above-threshold and the subthreshold current 
expressions into one unified model expression. We first note that in strong inversion 
we can write /lV gt = £ chi since ft — W ii n CJL and qn s — CiV gt at small V ds . Hence, all 
occurrences of the combination f J > V gt in (8.100) can be substituted by gd,;. Next, we 
replace all remaining occurrences of V gt by the effective gate voltage overdrive V gte that 
coincides with Vgt well above threshold and equals 2Vth below threshold. This converts 
(8.100) to the following unified form that approaches the correct limiting behavior above 
and below threshold: 



<?chi Vgte 

1 + gchi^s + + 2g c h!.R s + (V gte / Vl) 2 



(8.107) 



Here, V gte is the effective gate voltage overdrive, which can 
Section 1.5.3). 



Vgte 



Vth 



1 + 



X$_ 

2V lh 



+ 




be written as (see 



(8.108) 



where S determines the width of the transition region at threshold (V gt = 0). Typically, 
S — 3 is a good choice. 
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Figure 8.3 Experimental (symbols) and modeled (a) above-threshold and (b) subthreshold 
characteristics for a deep submicrometer «MOS with effective gate length L = 0.09 p,m. 
Device parameters: W = 1 p,m, <f; = 3.5nm, /x„ = 0.026 m 2 / Vs, v s = 6 x 10 4 m/s, m = 2.2, 
ft s = .R d = 200fl, A. = 0.142 /Y, r] = 1.7, Fro = 0.335V, cr 0 = 0.11, V a =0.2V, V at = 0.18V. 
Substrate-source bias F b s = 0 V. Reproduced from Ytterdal T., Shur M.. and Fjeldly T. A. (1994) 
Sub-0.1 |im MOSFET modeling and circuit simulation. Electron. Lett., 30, 1545, 1546 and Mii Y. 
etal. (1994) Experimental high performance sub-0.1 |xm channel ;?MOSFET’s, IEEE Electron 
Devices Lett., EDL-15, 28-30 
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Hence, using (8.107) for the saturation current and (8.103) for the channel conductance, 
the I-V expression of (8.105) becomes truly unified, with the correct limiting behavior 
both above and below threshold, and with a continuous transition between all regimes 
of operation. Additional nonideal effects related to short channels and high fields, to be 
discussed next, can now be added to this model. 

The quality of the present universal MOSFET model is illustrated in Figure 8.3 for 
a deep submicrometer «-channel MOSFET. Figure 8.3(a) shows the above-threshold 
I-V characteristics and Figure 8.3(b) shows the subthreshold transfer characteristics in 
a semilogarithmic plot. As can be seen from the figures, our model reproduces quite 
accurately the experimental data in the entire range of bias voltages, over several decades 
of current variation. The parameters used in the model calculations were obtained from the 
parameter extraction procedure described by Shur et al. (1992) (see also Lee et al. (1993)). 



8.3.3 Unified C-V Model 

The charge storage model used in the MOSA1 model is the unified Meyer C-V model 

described in Section 1.5. 3.1 and is not further discussed here. 
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Bipolar Transistors in CMOS 
Technologies 



9.1 INTRODUCTION 

As most analog designers have experienced, bipolar junction transistors (BJTs) can be 
realized in CMOS technologies. In most cases the bipolar action is unwanted and causes 
problems for the designers. This is why the BJTs in CMOS technologies are often referred 
to as parasitic devices. However, in some analog functions, the exponential relationship 
between the emitter current and the base-emitter voltage of bipolar transistors is needed. 
Probably the most important applications are PTAT 1 current sources and band-gap voltage 
reference (BVR) circuits. 

In this chapter, we will describe how BJTs are realized in CMOS technologies and 
discuss important parameters of this device. 

9.2 DEVICE STRUCTURE 

In an «-well CMOS technology, a vertical BJT structure is formed by placing a region 
of p + diffusion within an n-type well as shown in Figure 9.1. As indicated by the BJT 
symbol included in the figure, the device formed is a PNP transistor. 

Since the width of the base is determined by the depth of the well, which is relatively 
large, the current gain p of the BJT is quite low, usually between 5 and 10. Also, notice 
from the figure that the substrate itself realizes the collector terminal. Hence, the transistor 
is not very suited for use in amplifiers. 

We should also mention that some foundries refer to this device as an n-well junction 
diode in which the emitter corresponds to the anode of the diode and the well contact is 
the cathode. 

9.3 MODELING THE PARASITIC BJT 

As mentioned in the introduction, for the most important applications of the parasitic BJT, 
the only interesting characteristic is the emitter current versus the base-emitter voltage. 

1 PTAT: Proportional to absolute temperature. 
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Base Emitter 




Figure 9.1 A BJT structure realized in an «-well CMOS technology 

Therefore, in this chapter we have chosen to discuss only the modeling of the base-emitter 
pn-junction and we assume that the base terminal is grounded. We treat the device as a 
diode where the anode is represented by the emitter terminal, as shown in Figure 9.1, and 
the cathode is connected to ground. 

The symbol and the definitions of the voltage across the device and the current through 
the device are shown in Figure 9.2. 

Before we present the model for this device, we would like to discuss its characteristics. 
In Figure 9.3 we have plotted the emitter current 7 e versus the emitter voltage V e using 
a logarithmic scale on the vertical axis. The plot is based on simulation results of a 
parasitic BJT from a commercially available 0.5 -pm CMOS technology. We notice from 
the figure that for current levels from about 1 pA to 0. 1 mA the device exhibits the wanted 
exponential relationship between the applied voltage and the current through the device 
since we have a straight line in a semilogarithmic plot. In this region, the characteristics 
resemble that of an ideal diode and we can apply the well-known diode equation. However, 
at current levels below 1 pA and above 0. 1 mA, the characteristics deviate considerably 
from a straight line. Next we will discuss how to model these characteristics to take into 
account the nonideal effects. 

When designing circuits that are based on the assumption of exponential characteristics, 
the designer should generate a plot similar to the one shown in Figure 9.3 and identify 
the range in which the assumption holds. Then the designer should bias the device within 
this range. 



Emitter 

9 






V 



Figure 9.2 Symbol and definition of voltages and currents for the parasitic pnp BJT device 
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We start the description of the model by presenting the ideal diode equation and 
then show how to extend it to account for the nonideal effects. We will not include the 
background physics for the derivation of the ideal diode equation. Interested readers are 
referred to other texts such as Fjeldly el al. (1998). 



9.3.1 The Ideal Diode Equation 

The ideal diode equation can be written as 

I e = I s [e V ° /inV±) ~ 1], (9.1) 



where / s is the junction saturation current and n is the ideality factor. Furthermore, 
Vth is the thermal voltage given by k^T/q, where k\> y is the Boltzmann constant 
(= 1.38066 x 10~ 23 J/K), T is the absolute temperature in kelvin, and q is the electronic 
charge (= 1.60218 x 1 0 19 C). The junction saturation current I s can be expressed as 



h = Aq 



DpUpo k) ; : p, :<! \ 

L P L„ ) 



(9.2) 



where A is the area of the device, D p is the diffusion constant for holes, n po is the 
minority carrier concentration of electrons on the p- side, L p is the hole diffusion length, 
D n is the electron diffusion constant, p no is the minority carrier concentration of holes 
on the n-sidc, and L n is the electron diffusion length. The minority carrier concentrations 
are given as the ratio of the square of the intrinsic carrier concentration h, and the doping 
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Table 9.1 Silicon diode parameters at room temperature 



Parameter 


Description 


Value 


D„ 


Electron diffusion constant 


37.5 cm 2 /s 


D p 


Hole diffusion constant 


13cm 2 /s 


L n 


Electron diffusion length 


~0.08 cm" 


L P 


Hole diffusion length 


~0.03 cm" 


Hi 


Intrinsic carrier concentration 


1.02 x 10 10 /cnr 3 



"Valid only for low doping concentrations 



concentrations as follows: 

n? 

np ° = — , (9.3) 

n 2 

Pno = ~ 7-. (9.4) 

Ad 

Here, N a and Ad are the concentration of dopants on the p- and h - sides, respectively. In 
Table 9.1, approximate parameter values are given. The ideality factor n varies from 1.0 
to about 1.5. 

9.3,2 Nonideal Effects 

The ideal diode equation described by (9.1) to (9.4) is in reasonable agreement with exper- 
imental data for only a limited range of currents. Outside this range, many nonideal effects 
come into play such as carrier generation/recombination in the depletion region, series 
resistance, and high-level injection. Here follows a brief discussion of some important 
nonideal mechanisms that are relevant for the applications we are interested in. 



9.3.2. 1 Series resistance 

In real devices, the parasitic series resistance R s of the device contacts and of the semi- 
conductor quasi-neutral regions may play an important role, especially at high forward 
bias. Then the diode equation for the total current has to be modified according to 

7 e = / S [e (v '- / ' s » ) /(" v > t) - 1] = / s [e Vei/< "' /,h) - 1], (9.5) 

where V e i is the intrinsic voltage. An approximate analytical solution of I e versus V e for 
this case was discussed by Fjeldly et al. (1991). We note that at sufficiently large forward 
bias, most of the applied voltage will fall across the series resistance and the I e — V e 
characteristic approaches a linear form with a slope determined by R s . 



9.3. 2. 2 High injection 

At large forward bias, when the voltage across the intrinsic diode approaches the built- 
in potential, the junction barrier is severely reduced and the assumption of a low-level 
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injection (where the densities of minority carriers are small compared to those of the 
majority carriers) is no longer valid. Instead, the injected minority carrier concentrations 
may become comparable to those of the majority carriers, and we enter the so-called 
high-injection regime. It can be shown that in the high-injection regime the denominator 
of the argument to the exponential function in (9.1) is changed to In V* . The result is 
that the rate of increase of the current with respect to the voltage is reduced. 

The effect of the series resistance and high-injection effects are visible in the I -V char- 
acteristics in Figure 9.3 in the high-current region when the current level is approaching 
1 mA. In this region we notice that the current levels off and becomes less than what it 
would have been if we had continued on the trace of the ideal diode equation. The initial 
leveling off is caused by high-injection effects and eventually the series resistance comes 
into play resulting in a linear increase in the current with respect to the applied voltage. 

9.3.23 Generation/recombination in the depletion region 

Generation and recombination processes are taking place in the depletion region (neglected 
in the ideal diode) seeking to restore equilibrium. These processes give rise to current 
contributions that may play an important or even dominant role. These contributions 
depend on the concentration, distribution, and energy levels of traps in the depletion 
region. Traps associated with various impurities and defects are always present in any 
semiconductor material. The simplest model describing the generation and recombination 
currents is based on the assumption that we have only one type of dominant trap, uniformly 
distributed in the device. In reality, traps may be nonuniformly distributed, and more than 
one type of trap may be involved. By applying these assumptions, it can be shown that 
the generation/recombination current can be modeled as 

/gr = hr (l - [e Vei/( " rVai) - 1] (9.6) 

where 7 sr is a saturation current parameter related to the generation/recombination process 
and n T is the ideality factor for the generation/recombination current (~2). Note that the 
first bracket on the right-hand side of (9.6) gives the scaling of the depletion layer width 
with the potential V ei for an arbitrary grading of the junction doping profile, specified 
by the grading parameter m (1/2 for an abrupt junction and 1/3 for a linearly graded 
junction). Vj is the contact (built-in) potential of the junction (~0.7 V for silicon at room 
temperature). The generation/recombination current 7 gr is added to the ideal diode equation 
in (9.1). 

The effect of generation/recombination currents is visible in the 7-V characteristics 
in Figure 9.3 in the low-current region. Notice that the deviation from a straight line in 
this region causes the total current to become higher than the ideal current. This effect is 
caused by generation/recombination currents. 
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10 

Modeling of Passive Devices 



10.1 INTRODUCTION 

Passive devices such as resistors, capacitors, and inductors find wide usage in analog 
and RF CMOS integrated circuits. In this chapter, we will discuss the different structures 
utilized to implement the different passive devices and describe how to model them. 



10.2 RESISTORS 

Analog circuits usually include resistors in a variety of applications to implement functions 
such as current limiting and voltage division. Fortunately, most processes offer a choice 
of several different resistor structures utilizing different materials. Some structures are 
available in standard digital processes, while others require specific analog options that 
add extra cost to the fabrication of the circuit. In this section we will describe different 
resistor structures and discuss how they are modeled. However, first we will briefly discuss 
resistivity, sheet resistance, contact resistance, and resistor layout. 

The resistance of a resistor can be calculated from its dimensions and resistivity. In 
Figure 10. 1, we show a simple resistor structure constructed from a slab of a homogeneous 
material and perfectly conductive contacts. 

Using the symbols defined in the figure, the resistance of the structure shown can be 
written as 




where p is the resistivity of the material. For integrated CMOS circuits the resistors are 
either diffusions or depositions that can be treated as films of constant thickness. Therefore, 
it is common to combine resistivity and thickness into a single term, called the sheet 
resistance R s h = p/t. Using this definition of the sheet resistance, (10.1) can be written as 

R = R S h^- ( 10 . 2 ) 

In (10.2), the ratio L/W is often called the number of resistance squares. Note that (10.1) 
and (10.2) are valid only for low frequencies in which the skin effect is not present. As 



Device Modeling for Analog and RF CMOS Circuit Design. T. Ytterdal, Y. Cheng and T. A. Fjeldly 
© 2003 John Wiley & Sons, Ltd ISBN: 0-471-49869-6 





250 MODELING OF PASSIVE DEVICES 




Figure 10.1 A simple resistor structure consisting of a slab of a homogeneous material contacted 
by perfectly conductive terminals 



the frequency is increased, the 
section of the resistor. Instead, 
To account for this effect, we 
as follows: 



current is not distributed uniformly throughout the cross 
the current is concentrated at the surface of the material, 
modify the expression for R s h to include the skin effect 



^sh = 



P 

<5(1 -e-'/y 



(10.3) 



where 




(10.4) 



Here, <5, coq, and /m represent the skin depth, the radial frequency, and the permeability 
in vacuum, respectively. 

In the above discussion, we have assumed that the contacts at both ends of the resistor 
have zero resistivity. Since real contacts have a finite resistance, we have to include terms 
in (10.2) that account for the resistance of the contacts. A typical layout of a resistor 
consisting of a body and two contact heads is shown in Figure 10.2. Here Lb and W b are 
the length and the width of the body, respectively, and W c and W 0 are the width of the 
contact and the overlap of the head, respectively. 

Two different mechanisms give rise to resistance in the contacts. The first mechanism 
is caused by a potential barrier that exists between the metallization and the material 
of the resistor. The second mechanism is due to the current spreading or crowding as 
it enters the contact. If W c + 2W 0 is greater than Wb, the current spreads out and the 
resulting resistance is slightly decreased. The total resistance of the resistor structure in 
Figure 10.2 is given by 

R = R b + 2R h , (10.5) 







Figure 10.2 The layout of a typical resistor consisting of the resistor body and two resistor heads 
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where R b and R b are the resistances of the body and the head, respectively, and is given by 

R b = Rsh^ ( 10 . 6 ) 

W b 

and 

R h = R c + R sh ^ + A n ) . (10.7) 

Here, R c is the contact resistance and An is an adjustment of the number of resistance 
squares caused by current spreading or crowding. If W c + 2 W 0 > Wb, An is negative, 
otherwise it is positive. The following approximate formula for An was proposed by 
Ting and Chen (1971): 



where 



An = — 

IT 



-In 

k 




+ In 



k 2 - 1 

— ¥~ 



k = 



w b 

Wb - w c ■ 



( 10 . 8 ) 



(10.9) 



This formula is strictly valid only for W c Wb — W c . 

The contact resistance R c in (10.7) results from the presence of a potential barrier 
between the resistance material and the metal. Murrmann and Widmann (1969) showed 
that the contact resistance could be written as 



Rc 



\/ R$hPc 

Wc 



coth 




( 10 . 10 ) 



where p c is the specific contact resistance and L c is the length of the contact (L c = W c 
if square contacts are used). In Table 10.1, we list contact resistances for two common 
contact systems. 

This finalizes our general discussion of resistors. In the following sections, we will 
discuss the most important resistors types available in CMOS processes in terms of sheet 
resistance and equivalent circuits including parasitics. 



10.2.1 Well Resistors 

In digital CMOS processes, where there is no high sheet resistance poly available, we 
may use one or more wells to implement resistors. In conventional CMOS processes. 



Table 10.1 Typical contact resistances for two common contact 
systems (Hastings 2001) 



Contact system 


Sheet resistance R sb (O/sq) 


R c (0|xm 2 ) 


Al-Cu-Si 


160 


750 


Al-Cu/Ti-W/PtSi 


160 


1250 
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Figure 10.3 Equivalent circuit of a well resistor 

there is usually only one type of well available, either n-well or p-well. However, in 
state-of-the-art scaled down processes (as of this writing, 0.18-p.m CMOS and newer), it 
becomes more and more common to have several well types. 

The most important feature of well resistors is the relative high sheet resistance, on 
the order of 1 to 10 kilo-ohms per square. The disadvantages of well resistors are high 
temperature coefficients (TCs) (may be as high as 6000ppm/°C), voltage dependency, 
and large parasitic capacitances to ground since the well is located close to the substrate. 
A li equivalent circuit of a well resistor valid up to moderate frequencies (~ 100 MHz) 
is shown in Figure 10.3. Note that the parasitic capacitors are implicitly included through 
the reverse-biased diodes. By using diodes in the equivalent circuit, leakage currents to 
ground are also accounted for in the equivalent circuit, which is important at elevated 
temperatures. If the resistor is long, a hr equivalent circuit may not be accurate enough. 
In such cases the equivalent circuit in Figure 10.3 can be easily extended to 3n and 5 it 
equivalents. 

As a final note on well resistors, we would like to point out an important issue. When 
laying out well resistors, remember that the width should be at least twice as large as 
the depth of the well. Otherwise the resistor does not achieve full junction depth and the 
sheet resistance becomes much higher than the values reported by the foundry. 



10.2.2 Metal Resistors 

When small resistances are desired, maybe the best choice is to use one of the metal layers. 
Sheet resistance of the different metal layers is typically in the range 20 to 40 mO/sq. 
The advantages of metal resistors are low parasitic coupling to the substrate, low-voltage 
dependency, and low TCs. 



10.2.3 Diffused Resistors 

Diffused resistors can be realized in a CMOS process by making contacts to each side 
of an implanted region (the same type of region that is used for the drain and source of 
MOS transistors). Resistors made this way exhibit sheet resistances in the range 20 to 
50 Q/sq if silicide blocks are used. Hence, using this resistor type may add extra cost to 
the manufacturing (check the documentation of the process you are using). This resistor 
type is not often used, since most CMOS processes offer poly resistors that have equal 
or greater sheet resistances (see next section). 
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10.2.4 Poly Resistors 

Polysilicon is used in all modern CMOS processes for producing the gate of the MOS 
transistors. After deposition, the poly is heavily doped to improve conductivity for obtain- 
ing high-speed operation of the MOS transistors. The sheet resistance of heavily doped 
poly lies in the range of 1 to 20 £2/sq. At en extra cost, a mask can be manufactured that 
stops heavy poly doping at regions where resistors are desired. As a result, lightly doped 
poly can be produced with sheet resistance varying between 20 to 1000 O/sq. The TC of 
poly resistors can have both positive and negative values depending on the doping density 
and the type of doping atoms used. In general, the absolute value of the TC increases 
with the sheet resistance. 

Similar to well resistors, the equivalent circuit of poly resistors also includes parasitic 
capacitances to ground. However, since the poly layer is located further from the substrate 
compared to wells, the capacitance per unit resistor area is smaller for poly resistors 
compared to well resistors. Poly resistors may have unit-area parasitic capacitances on 
the order of 0.1 fF/|xm 2 . A l7r equivalent circuit of a poly resistor valid up to moderate 
frequencies is shown in Figure 10.4. 

In RF CMOS circuits, only poly and metal resistors are used since the other resistor 
types are located too close to the substrate, which causes the substrate loss to be larger 
than necessary. At gigahertz (GHz) frequencies, the equivalent circuits discussed so far 
are not accurate enough. Effects that need to be included are the substrate loss and the 
parasitic series inductance. An equivalent circuit valid at frequencies up to about 10 GHz 
is shown in Figure 10.5. 

The only intended element in Figure 10.5 is the ideal resistor R. All the other elements 
are considered parasitics. The self-inductance of the metal or the poly resistor is modeled 
by inductor L p . Parasitics due to the substrate are commonly modeled in terms of C p , C s , 
and R s . The lateral dimensions of the structure are on the order of hundreds of microm- 
eters, which is much larger than the oxide thickness and comparable to the substrate 
thickness. Thus, the substrate capacitance and resistance are approximately proportional 
to the area occupied by the resistor and can be estimated by 



1 £()£ox 

C p - -L b lF b ^, 

^ *ox 


(10.11) 


^sub = ^sub0^b/Wb> 


(10.12) 


1 

Cs ^L\yW\)C sub > 


(10.13) 


2 

Rs ~ L b W b G sub ■ 


(10.14) 
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Figure 10.4 Equivalent circuit of a poly resistor 
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Figure 10.5 Equivalent circuit of a poly resistor that is valid at gigahertz frequencies 

Here e 0 (= 8.85418 x 10 -12 F/m) is the permittivity in vacuum, and e ox and t ox are 
the dielectric constant and the thickness of the field oxide, respectively. Furthermore, 
R SU bo, C SU b, and G su b are technology parameters that must be provided by the foundry. 
Equations (10.11)— (10.14) are valid only if the body of the resistor is much larger than 
the heads. 



10.3 CAPACITORS 



Capacitors have become ubiquitous in analog-integrated circuits particularly owing to the 
switched capacitor technique for realization of analog-to-digital and digital-to-analog data 
converters and discrete-time filters. Other applications include continuous-time filters, RF 
building blocs, and for compensation in feedback amplifiers. 

Farad is the SI unit for capacitance. In the context of integrated circuits, the range of 
values is between several femtofarads (fF) to about 100 picofarads (pF). Capacitors used 
in integrated circuits are parallel-plate capacitors in which a slab of an insulating material 
called the dielectric is sandwiched between two conductive plates. The capacitance value 
of a parallel-plate capacitor is probably known to the reader, but is repeated here for your 
convenience: 



A£ 0 £j 

h 



(10.15) 



where A is the area of the plates, i: l and t, are the relative permittivity (or dielectric 
constant) and thickness of the insulator, respectively. The dielectric constants of the two 
most common insulators used in CMOS are shown in Table 10.2. 

The expression in (10.15) slightly underestimates the capacitance since not all field 
lines go through the insulator. Some of the field lines go through the air and is called the 
fringing field, which increases the apparent area of the plates. This effect is proportional 
to the thickness of the insulator. In integrated circuits it is usually neglected since the 
vertical dimensions are usually much smaller than the dimension of the plates. 
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Table 10.2 Relative permittivities of insulators 
used in CMOS technologies 



Material 


Relative permittivity 


Silicon dioxide (Si0 2 ) 


3.9 


Silicon nitride (S^N^ 


7.5 



The available capacitor structures in CMOS technologies are poly-insulator-poly (or 
simply poly-poly) capacitors, metal-insulator-metal (MIM) capacitors, MOSFET capaci- 
tors, and junction capacitors. These different capacitor types are discussed in the follow- 
ing sections. 



10.3.1 Poly -poly Capacitors 

As the name indicates, both plates of a poly-insulator-poly (or simply poly-poly) capacitor 
are made of deposited polysilicon that is doped to keep the resistivity low. The bottom 
plate is usually implemented using the same layer as the poly gate of MOSFETs. The other 
plate must be supported by a second poly layer. There are extra processing steps involved 
in poly-poly capacitors since the insulator is unique to this structure. Hence, if you want 
poly-poly capacitors you would have to pay extra for the fabrication of your circuits. An 
example vertical cross section of a poly-poly structure is shown in Figure 10.6. 

We note from the figure that the bottom plate is placed on top of field oxide to 
shield the plate from the substrate. Of course, the shielding is not perfect and there is a 
parasitic capacitance connected between the bottom plate and the substrate. This parasitic 
capacitance, which is labeled C p i in Figure 10.6, can be up to 20% of the intended 
capacitance (labeled C). The parasitic capacitance labeled C P 2 is much smaller than C p i 
since the distance between the metal layer and the substrate is large. 

An equivalent circuit valid up to moderate frequencies can be extracted from the vertical 
cross section in Figure 10.6 and is presented in Figure 10.7. At gigahertz frequencies the 
lossy substrate has to be accounted for in the equivalent circuit using an approach similar 
to that was used for resistors in Section 10.2.4. 




Substrate (AC ground) 



Figure 10.6 Vertical cross section of a poly-poly capacitor structure 
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Figure 10.7 Equivalent circuit of poly-poly capacitors valid up to moderate frequencies 

The capacitance per unit area of poly-poly capacitors changes from one technology 
generation to the next because both vertical and lateral dimensions are scaled. Hence, it 
is not possible to provide a number that can be universally applied. For example, at the 
0.5-p.m technology node, the capacitance per unit area was around 1 fF/|im 2 . You would 
have to refer to the documentation of your current process for accurate numbers. 

There is a voltage modulation of poly-poly capacitors caused by poly-depletion since 
the conductivity of the polysilicon plates is finite. Usually, the voltage dependency is 
modeled using a second-order polynomial description as follows: 

C = C 0 (l +kiV + k 2 V 2 ), (10.16) 

where Co is the capacitance at zero applied voltage, k\ and k 2 are the first- and second- 
order coefficients of the voltage dependency, and V is the voltage across the capacitor. 
The coefficients k\ and k 2 must be supplied by the foundry or extracted from measured 
data. If the linearity of poly-poly capacitors is not sufficient, one can instead use metal 
plates. This type of capacitors is discussed in the following section. 



10.3.2 Metal -insulator -metal Capacitors 

Usually, when referring to metal-insulator-metal (MIM) capacitors, we assume that struc- 
tures involving thin insulators are provided by dedicated process steps. This is a relatively 
novel structure that made its introduction in recent technology generations. However, the 
principle of using metal layers as the plate material has been utilized by design houses 
for a long time to cut cost since the double poly option could be skipped. The draw- 
back of such capacitors is the low capacitance per unit area (typically 0.05 fF/p,m 2 ) since 
the oxide used between metal layers is quite thick. Such capacitors will not be further 
discussed here. Instead we will focus on MIM capacitors since they are rapidly becom- 
ing very popular owing to their high linearity and high unit-area capacitance. In modern 
state-of-the-art processes, MIM capacitors are replacing poly-poly capacitors owing to 
their improved linearity and mismatch characteristics since the conductivity of the metal 
plates is higher than that for the corresponding polysilicon plate, which reduces the effect 
of depletion. 

Usually, the MIM capacitor is realized using one of the conventional metal layers as 
the bottom plate and a dedicated thin metal layer placed between two conventional metal 
layers to realize the top plate. The insulator is made of a thin dielectric with a thickness 
of about 40 nm. 




CAPACITORS 257 




/ 

Oxide 



Figure 10.8 Top view of a lateral flux capacitor utilizing a single metal layer 

The area consumed by capacitors tends to be large. Therefore, the search for more 
area-effective structures is on and considerable amount of research has been conducted 
to exploit new structures both in the vertical and in the lateral dimension. In the vertical 
dimension, the area efficiency can be improved by using more than two metal layers. With 
the continuous downscaling of the minimum line width in CMOS technologies, the lateral 
field has been exploited to realize capacitor structures such as the lateral flux capacitance 
and quasi-fractal capacitors with enhanced capacitance per unit area (see, for example, 
Akcasu (1993); Samavati el al. (1998); Stolmeijer and Greenlaw (1999)). The top view of 
an example lateral flux capacitor structure realized using a single metal layer is shown in 
Figure 10.8. As a final remark on MIM capacitors, we should mention that the equivalent 
circuits that are used for poly-poly capacitors can also be used for MIM capacitors. 

10.3.3 MOSFET Capacitors 

MOS transistors can be used as capacitors. Their selling point is the high unit-area capac- 
itance due to the thin oxide. For example, in 0.13-|im CMOS processes the unit-area 
capacitance is about 10fF/|xm 2 . If the transistors are placed in isolated wells, floating 
capacitors with both negative and positive applied voltages can be realized. The connec- 
tion of the transistor in such a configuration is shown in Figure 10.9. One of the most 
important applications of such capacitors is frequency compensation in feedback systems 
since the accuracy of the capacitance value is not important; it just has to be larger than a 
given value. There is only one issue to be aware of when using such a capacitor as a fre- 
quency compensation element, and that is the dip in the capacitance around the threshold 
voltage of the transistor as shown in Figure 10.10. 



o- 



-o 



+ 



Figure 10.9 MOS transistor in an isolated well configured as a floating capacitor 
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V C (V) 

Figure 10.10 Capacitance-voltage characteristics of a MOS transistor configured as in Figure 10.9 



Another important application of the capacitance element shown in Figure 10.9 is as 
a variable capacitor (varactor) in RF circuits as a frequency-tuning device. In this type 
of application, we want to be in the dip region in Figure 10.10 where the capacitance is 
most sensitive to the voltage across it. In varactor structures the gate structure and the 
contacts can be placed over both n- and p-type wells to create both conventional MOS 
transistors and accumulation mode transistors. 



10.3.4 Junction Capacitors 

Another type of capacitor available to designers of CMOS circuits is the junction capacitor 
that utilizes the depletion region surrounding a reverse-biased pn-junction as the insulator. 
An example structure is shown in Figure 10.11 where the capacitor is formed by placing 
an n + diffusion region inside a p-type well. The permittivity of silicon is about three 
times higher than that of silicon dioxide, so the capacitance per unit area is quite high. 
For example, in 0.13-p.m CMOS processes it is on the order of 2fF/p,m 2 . 

We note from Figure 10.11 that the effective area to be used when estimating the 
capacitance value of the structure consists of two parts, the bottom and the sidewalls of 
the n + diffusion region. The area of the bottom region is usually approximated by the 
length multiplied by the width of the n + diffusion region. An approximate, and most 
often used, estimate of the area of the sidewalls is the depth of the junction, which is 
a technology parameter, multiplied by the perimeter of the n + diffusion region times a 
constant to account for the nonrectangular shape. Usually, the foundries provide numbers 
for two unit capacitors and the total capacitance at zero bias can be calculated as follows: 



CtotalO — Cjo A + Cj sw () P , 



(10.17) 
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Figure 10.11 A junction capacitance formed by an n + diffusion region inside a p-type well 

where A and P are the area and the perimeter of the n + diffusion region, respectively, 
and Cjo and Cj SW o are the capacitance at zero bias per unit area and perimeter of the of 
the n + diffusion region, respectively. 

Since we have denoted the above parameters at zero bias, you have probably guessed 
that the junction capacitance is voltage-dependent. To illustrate this, we have plotted in 
Figure 10.12 the junction capacitance versus applied reverse voltage for a commercially 
available 0.13-p.m CMOS technology. We note from the figure that the capacitance is 
maximum at zero voltage and has decreased by about 40% at a reverse bias of 2 V. The 
capacitance decreases with the square root of the applied voltage because the width of 
the depletion region increases at this rate. 




Figure 10.12 Junction capacitance versus applied reverse voltage for a commercially available 
0.13-p.m CMOS technology 
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Table 10.3 Diode junction parameters in SPICE 



Parameter 


Description 


Default value 


CJ0 


Zero-bias junction capacitance 


OF 


CJSW 


Zero-bias sidewall capacitance 


OF 


MJ 


Grading coefficient 


0.5 


MJSW 


Grading coefficient of the sidewalls 


0.33 



The voltage dependency illustrated in Figure 10.12 is the most serious drawback of 
the junction capacitance. Another problem, that is particularly troublesome at elevated 
temperatures, is the conductance that exists in parallel with the capacitor that causes the 
capacitor to leak. This effect is dominated by generation/recombination currents in the 
depletion region. 

Usually, a junction diode is used to model junction capacitors, where the key model 
parameters in SPICE are listed in Table 10.3. 



10.4 INDUCTORS 

Interest in integrated spiral inductors in CMOS processes has surged with the recent 
growing demand for CMOS RF communication circuits. This is because inductors form 
the core to a successful integrated radio transmitter and receiver (transceiver). While the 
integration of resistors and capacitors in CMOS is a well understood process, the inclusion 
of inductors is still a challenging endeavor. In recent years a great amount of work has 
been published on the analysis and modeling of on-chip spiral inductors (see, for example, 
Lutz et al. (1999); Yue and Wong (2000); Kythakyapuzha and Kuhn (2001)). 

The spiral inductor is the most common structure for implementing on-chip inductors 
in standard CMOS processes. A spiral structure is laid out using two or more of the top 
metal layers. The top metal layers are used since this will minimize the resistive coupling 
to the substrate to reduce the loss as much as possible. In Figure 10.13, we show a typical 
layout of an inductor using two metal layers. More than two metal layers can be utilized 
to increase the Q factor (Lutz et al. 1999). 

The inductance value of spiral inductors is a complicated function of geometry and is 
usually estimated from two-dimensional electromagnetic (EM) simulations. For a crude 
estimate (within ±30%), the following equation can be used (Lee 1998): 

L = n 0 n 2 r « 1.2 ■ 10~ 6 n 2 r, (10.18) 

where n and r represent the number of turns and the radius of the spiral, respectively. 

The key to accurate modeling of the spiral inductor at high frequencies is the ability to 
identify the important parasitics. Fortunately, in most cases the foundries do this job for us 
providing an equivalent circuit and the values for each circuit element. The values for the 
circuit elements are usually extracted from predefined structures. Hence, if the designers 
wish to modify the structures to optimize the inductor for either a special frequency range 
or an application, the characterization task has to be performed again. 

A commonly used equivalent circuit of an integrated spiral inductor is shown in 
Figure 10.14 (see, for example, Yue and Wong (2000)). 
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Figure 10.13 Typical layout of a spiral inductor using two metal layers 



L R s 




Figure 10.14 Equivalent circuit of an integrated spiral inductor on a silicon substrate 

Note that since inductors are inserted in a circuit to store magnetic energy, both 
capacitors and resistors are considered parasitics, resistors because they introduce loss 
(decreasing the Q value) and capacitors because they store electric energy and provide 
coupling to the lossy substrate. Thus, the only intended element in Figure 10.14 is the 
ideal inductor L. 

The resistance of the metal spiral is modeled by the resistor R s in series with the 
inductor. The resistance value can be calculated using (10.2) and (10.3). 

The capacitor denoted as C s in Figure 10.14 models the capacitive coupling between 
the input and the output ports of the spiral inductor. By inspecting the physical layout 
of the inductor in Figure 10.13, we notice that both the overlap between the spiral and 
the underpass and cross-talk between adjacent turns contributes to C s . In most practical 
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inductors, adjacent turns are on almost the same potential and the total capacitance is 
dominated by the overlap capacitances. Hence, it is sufficient to model C s as a sum of 
overlap capacitances as follows: 



C S =«W S 2 



£q£ox 
to x_m 



(10.19) 



where IV, is the spiral line width, e ox is the dielectric constant of the oxide, and i ox _, n is 
the vertical distance between the spiral and the underpass. 

Parasitics due to the substrate are commonly modeled in terms of C ox , Csi, and R$i 
(see, for example, Hughes and White (1975)). For spiral inductors on silicon, the lateral 
dimensions of the spiral structure are on the order of hundreds of micrometers, which is 
much larger than the oxide thickness and comparable to the substrate thickness. Thus, the 
substrate capacitance and resistance are approximately proportional to the area occupied 
by the spiral and can be estimated by 



Cox = 


1 £()£oX 

L,W S 0 °\ 

2 t 
‘ox 


(10.20) 


c si = 


1 

— z. s w s c sub , 


(10.21) 


R S i = 


2 


(10.22) 


U WgGsub 



Here, C su b and G su b are technology parameters usually provided by the foundry. 
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11 

Effects and Modeling of Process 
Variation and Device Mismatch 



11.1 INTRODUCTION 

As CMOS IC fabrication technology becomes more and more advanced, the control of 
process variation and manufacturing uncertainty becomes more and more critical. The 
circuit yield loss caused by the process and device parameter variation has been more 
pronounced than before. Because CMOS technology demonstrates a promising future for 
low-power and low-cost analog and mixed-signal applications, designers are currently 
utilizing advanced deep-submicronreter technologies to achieve the goal of low-voltage 
and high-performance integration. However, owing to difficulty in controlling the variation 
in different fabrication steps such as gate oxide growth, channel and source/drain implants, 
photolithography, etching, and so on in modern technologies, the variation in device 
parameters will become larger compared with older technology. Designers need physical, 
predictive, and accurate statistical models to describe the device (and hence the circuit) 
parameter variation caused by process variations. 

Typically, the variations of device/circuit characteristics are divided into two categories, 
interdie variation and intradie variation/mismatch. The interdie variation describes the die- 
to-die, wafer-to-wafer, or lot-to-lot process variability. In other words, the same variation 
is assumed for the devices in the same circuit, so the interdie device variation has little 
influence on the circuit behavior of some analog circuits such as a current mirror with a 
constant current bias as long as all transistors can still be biased in the saturation region. 
The intradie device variation and mismatch describes the die-/wafer-level process variabil- 
ity caused by the variation of some process parameters such as oxide thickness and doping 
profile across the die/wafer. This is sometimes called local process variation and mismatch 
(LPVM) (Pronath et al. (2000); Zanella et al. (1999)). In this case, devices in the same 
circuit may have different variations in electrical parameters caused by LPVM. For digital 
circuits, the influence of interdie variations on the circuit performance is important. So 
most circuit simulators with statistical modeling capability for digital applications ignore 
the intradie variations when simulating the circuit behavior with considerations of process 
and device variation, which does not cause problems because the device variation caused 
by the interdie variability is indeed much larger than that caused by intradie variability. 
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The mismatching of devices in the same circuits has little influence on the circuit behavior. 
However, for analog circuits, not only interdie process variation but also intradie process 
variation and device mismatch influence significantly the variations of the circuit behav- 
iors (Pelgrom et al. (1998); Groon et al. (2001); Tarim et al. (2000)). Statistical modeling 
of devices with intradie process variation and mismatch is needed in designing analog 
circuits to statistically and accurately simulate/predict the analog circuit performance. 

Together with the MOS transistors, other devices such as resistors and capacitors used 
in analog circuit design are also sensitive to the LPVM. It is important to model the 
LPVM, which influences the electrical behavior of resistors, capacitors, and MOSFETs 
for analog/RF applications. 



11.2 THE INFLUENCE OF PROCESS VARIATION 
AND DEVICE MISMATCH 

11.2.1 The Influence of LPVM on Resistors 

Typical CMOS and BICMOS technologies offer several different resistors, such as dif- 
fusion n + /p + resistors, n + /p + poly resistors, and n-well resistor. Depending on the 
applications, designers select different resistors in circuit design. Since different materials 
have various electrical characteristics and temperature variations, the effects of process 
variation will be different for different types of resistors. Many factors in the fabrication 
of a resistor such as the fluctuations of the film thickness, doping concentration, doping 
profile, and the annealing conditions as well as the dimension variation caused by the 
photolithographic inaccuracies and nonuniform etch rates can vary the value of a resis- 
tor. Poly resistors, which consist of a strip of poly deposited on top of field oxide in 
LOCOS isolation technology or in shallow trench isolation technology, are widely used 
in analog circuit design. We take it as an example to discuss the influence of the process 
variation. 

The process control in the deposited polysilicon film thickness is one of the key fac- 
tors to vary the poly resistor value. Typically a modern CMOS process allows 10 to 15% 
variation in polysilicon thickness, which results in a similar amount of change in resis- 
tance. Further, the doping level determined by various implants and annealing steps in 
the process will contribute a 10 to 20% variation. The overall effects from the variation 
in poly silicon thickness and doping will result in a sheet resistance variation of 15 to 
25%. In addition, the line-width control, a measure of dimensional variation caused by 
photolithography and processing, is also critical in determining the resistance variation. 
Most modern processes can have line- width control of poly dimension less than ±10% 
of their minimum feature size. However, one should be aware that an extremely narrow 
poly resistor might have increased variation in the resistance owing to the growth of 
individual grains across the entire width of the resistor. When considering poly resistor 
etching, which easily causes a large variation in resistance value depending on the geom- 
etry of the resistor, typically, the overall (wafer-to-wafer and lot-to-lot) sheet resistance 
variation of poly resistors in modern IC technologies is in the range of 20 to 30% (Lane 
and Wrixon (1989)). The variation on a single wafer (die-to-die) is smaller. Figure 11.1 
shows a 3-sigma variation of unsalicided poly resistances measured from a single wafer 
in a 0. 1 8-p m CMOS technology. 




THE INFLUENCE OF PROCESS VARIATION AND DEVICE MISMATCH 265 




Figure 11.1 Measured data for characterizing the variation of the poly resistor on a single wafer. 
Reproduced from Cheng Y. (2002) The influence and modeling of process variation and device 
mismatch for analog/RF circuit design. Proceedings of the Fourth IEEE International Caracas 
Conference on Devices, Circuits and Systems , pp. 282-289 

As mentioned above, a poly resistor should always reside on the top of the field oxide, 
which not only reduces the parasitic capacitance between the resistor and the substrate but 
also ensures that oxide steps do not cause unexpected resistance variations. In addition, 
it should be noted that the oxide isolating the resistor will provide the isolation both 
electrically and thermally. A poly resistor that dissipates sufficient power will experience 
permanent resistance variations due to self-induced annealing. Extreme power dissipa- 
tion will melt or crack polysilicon long before diffused resistors of similar dimensions 
suffer damage. 

As discussed above, integrated resistors display significant variations in the sheet resis- 
tance due to the influence of the variation of some process and device parameters in the 
fabrication. However, this does not prevent the resistors from being used in various ana- 
log circuit designs as long as the device matching properties are within the range the 
designs require. For analog circuit applications, the matching property of resistors is as 
important as, or more important than, the variation of the absolute value of the resistance. 
The understanding and modeling of the mismatching of resistors is very important for 
analog and RF applications. 

Besides the factors causing the variation in absolute resistance, such as the fluctua- 
tions in dimensions, doping, and film thickness, more factors related to the fabrication 
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process and material could influence the matching behavior of the resistors. Typically 
two parameters, mean (M) and standard deviation (er), are used to describe the matching 
characteristics of the devices defined in Eqs. (11.1) and (11.2) (Box et al. (1978)): 



M = 



N 






N ’ 



a = 



Z(8i-M) 2 



i= 1 



N 



N - 1 



( 11 . 1 ) 



( 11 . 2 ) 



where <5, is the parameter value of the ;th sample unit. The mean is a measure of the 
systematic mismatch between the matched devices, caused by mechanisms that influence 
all of the samples in the same way. The standard deviation describes random mismatch 
caused by statistical fluctuations in process parameters or material properties. Once the 
mean and standard deviation have been determined using Eqs. (11.1) and (11.2), they 
can be used to predict worst-case mismatches according to so called 3-sigma or 6- 
sigma mismatch. 

During the processing, many factors can result in fluctuations of the resistance of the 
resistor and all these fluctuations can be categorized into two groups: one in which the 
fluctuations occurring in the whole device are scaled with the device area, called area 
fluctuations, and the other in which fluctuations take place only along the edges of the 
device and are therefore scaled with the periphery, called peripheral fluctuations. For a 
matched resistor pair with width W and resistance R , the standard deviation of the random 
mismatch between the resistors is (Shyu et al. (1982)) 



a — 




WsfR 



(11.3) 



where / a and / p are constants describing the contributions of area and periphery fluctua- 
tions, respectively. 

Figure 11.2 shows the measured data of 3-sigma deviation versus resistance (geometry) 
from a group of unsalicided resistors. It shows that the resistors with narrow widths have 
poorer matching characteristics, which can be explained by Eq. (11.3). In circuit applica- 
tions, to achieve required matching, resistors with width (at least 2-3 times) wider than 
minimum width should be used. Also, as shown in Figure 1 1 .2, resistors with higher resis- 
tance (longer length) at fixed width exhibit larger mismatching. To achieve the desired 
matching, it has been a common practice that a resistor with long length (for high resis- 
tance) is broken into shorter resistors in series. 



11.2.2 The Influence of LPVM on Capacitors 

Current CMOS technology provides various capacitance options, such as poly-to-poly 
capacitors, metal-to-metal capacitors, MOS capacitors, and junction capacitors. The more 
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Figure 11.2 Measured 3-sigma mismatching of poly resistors. Reproduced from Cheng Y. (2002) 
The influence and modeling of process variation and device mismatch for analog/RF circuit design. 
Proceedings of the Fourth IEEE International Caracas Conference on Devices , Circuits and Systems, 
pp. 282-289 

popular ones used in the design are metal-to-metal capacitors, junction capacitors, and 
MOS capacitors. It has been known that the integrated capacitors show significant vari- 
ability due to the process variation. For example, both the capacitance value and matching 
property of a metal-insulator-metal(MIM) capacitor are sensitive to the variation in the 
thickness of dielectric and geometry of metal plates. For a MOSFET or a MOS capacitor, 
the capacitance values are strongly dependent on the changes in oxide thickness and dop- 
ing profile in the channel besides the variation in geometries. Typically a modem CMOS 
process can maintain a variation in some integrated capacitors such as MOS capacitors 
within ±20%. However, the variation in MIM capacitors can be larger than 20%, espe- 
cially the one called parasitic capacitor or lateral capacitor, distinguishing it from the 
typical (vertical) MIM capacitors, because this kind of capacitor is dependent not only 
on the thickness of the dielectric and the geometries but also on the permittivity of the 
dielectric him that varies with the composition and the growth condition of the dielectric. 
Similarly, a ±15-20% variation in junction capacitors should be expected in a modern 
process, and the variation becomes larger in capacitors with large area. 

Similar to the resistors, the matching behavior of capacitors depends on the random 
mismatch due to periphery and area fluctuations (without including the fluctuation of the 
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oxide thickness) with a standard deviation (Shyu et al. (1982)), 



er = 




s/C 



(11.4) 



where / a and / p are factors describing the influence of the area and periphery fluctuations, 
respectively. A more complex formula to account for the random oxide and mobility 
effects has also been reported by Shyu et al. (1984). 

As given in Eq. (11.4), the contribution of the periphery components decreases as the 
area (capacitance) increases. For very large capacitors, the area components dominate 
and the random mismatch becomes inversely proportional to sfc. Figure 11.3 shows 
the mismatching characteristics of the MIM capacitor. Even though the mismatching of 
such kinds of MIM capacitors is larger than that of other capacitors such as regular 
MIM capacitors, the trend of mismatching versus capacitance approximately follows that 
represented by Eq. (11.4). 




0.01 0.1 1 to too 

Capacitance (pF) 

Figure 11.3 Measured 3-sigma mismatch of a MIM capacitor. Reproduced from Cheng Y. (2002) 
The influence and modeling of process variation and device mismatch for analog/RF circuit design. 
Proceedings of the Fourth IEEE International Caracas Conference on Devices, Circuits and Systems, 
pp. 282-289 
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11.2.3 The Influence of LPVM on MOS Transistors 

MOSFETs are the most complex components in a CMOS technology fabrication. The 
variations in many process parameters can result in the variations in device characteris- 
tics. The more important ones are the variation in oxide thickness, doping concentration, 
and profiles in both the channel region and the source/drain region, and device chan- 
nel length/width. For 0.1-|xm or less advanced technology, the variations in doping and 
annealing condition for the polysilicon gate and in silicon-oxide interface characteristics 
should be taken into account because they will influence significantly some electrical 
parameters such as the traps/interface charges, effective oxide thickness, mobility, and 
so on. 

As MOSFETs attract more and more analog/RF applications, the matching charac- 
teristics are crucial in achieving precise analog/RF circuit design. Owing to the fast 
development of very large-scale integration (VLSI) technologies, the minimum feature 
sizes of MOSFETs in several advanced technology generations, such as 0.13 p,m and 
less, have been scaled to an unbelievable geometry range, which is reaching the physical 
limit for the devices to operate properly according to the classic device physics theory. In 
this case, many factors, which are negligible in influencing the device matching behavior 
of devices with large feature sizes (0.5 |xm and above), are now becoming important. 
Advanced device mismatching models based on the device characterization of modern 
technology are needed to include new factors that are more pronounced in devices in 
0.13 |xm and less technologies since analog IC applications, such as D/A converters and 
reference sources, would like to have approaches of reducing the layout area without 
degrading the device matching. 

Depending on the applications, different device factors are used to describe mismatch- 
ing behavior. The one used widely is A7 d /7 d , which typically includes two portions, 
threshold voltage mismatch and current factor mismatch as given in Eq. (11.5): 

A /. G m 1 

-AV th — ft A — (11.5) 

'd P 

where V th , ft, 7 d , and G m are the threshold voltage, current factor, drain current, and 
transconductance of the device, respectively. For analog applications, other factors for 
describing the mismatching behavior of small-signal parameters such as G m and G ds 
are also introduced (Thewes el al. (2000)). Furthermore, the influence of mismatching 
on RF behavior such as the distortion characteristics has been reported recently (Lee 
et al. (2000)). 

Figures 1 1.4 and 1 1.5 show the three-signal mismatching characteristics measured from 
a 0. 1 8-|un CMOS technology. The data in Figure 11.4 are measured by monitoring AV gs , 
representing the input offset voltage for a differential pair. A V gs is determined by applying 
the same gate and drain voltages to both transistors and then varying the gate voltage 
of one of the transistors until the drain current is equal to the other transistor. The data 
in Figure 11.5 are measured by monitoring the A7 d , representing the error between the 
currents in a simple current mirror. The A/ d in Figure 11.5 is defined as the percent 
difference in 7 d normalized by 7 d . 
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Figure 11.4 Measured data for 3-sigma mismatching of AV gs in PMOSFETs. Reproduced from 
Cheng Y. (2002) The influence and modeling of process variation and device mismatch for ana- 
log/RF circuit design. Proceedings of the Fourth IEEE International Caracas Conference on Devices, 
Circuits and Systems, pp. 282-289 
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Figure 11.5 Measured data for 3-sigma mismatching of A/ d in PMOSFETs. Reproduced from 
Cheng Y. (2002) The influence and modeling of process variation and device mismatch for ana- 
log/RF circuit design, Proceedings of the Fourth IEEE International Caracas Conference on Devices, 
Circuits and Systems, pp. 282-289 
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11.3 MODELING OF DEVICE MISMATCH 
FOR ANALOG/RF APPLICATIONS 

In the above section, we have reviewed the process variation and device mismatch and 
their influence on the device characteristics. Next we discuss the statistical modeling of 
the LPVM of these components. 



11.3.1 Modeling of Mismatching of Resistors 

The following equation is typically used to model a (polysilicon) resistor: 



R = R 



L sh 



■ + 



R e 



W + AW W + AW 



(11.6) 



where R %h is the sheet resistance of the poly resistor, R e is the end resistance coefficient, 
W and L are resistor width and length, and AW is the resistor width offset. 

The correlations between standard deviations (a) of the model parameters and the 
standard deviation of the resistance are given in the following: 
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To define the resistor matching, based on Eq. (1 1.7), we have 
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(11.12) 



Figure 11.6 shows the comparison result between the above model and the measured 
data of 3-sigma mismatching of unsalicided polysilicon resistors with different geometries. 
The maximum error percentage between the model and the data is less than 3%. 

11.3.2 Mismatching Model of Capacitors 

A simple MIM capacitor matching model, considering the most important sources of 
capacitance variation, is given in the following equations: 

OA c /C 2 = O 2 + or" + or d ", 



(11.13) 




272 EFFECTS AND MODELING OF PROCESS VARIATION 




Mean resistance (ohms) 

Figure 11.6 Comparison between the model and the measured data for 3-sigma mismatching 
of resistors. Reproduced from Cheng Y. (2002) The influence and modeling of process variation 
and device mismatch for analog/RF circuit design. Proceedings of the Fourth IEEE International 
Caracas Conference on Devices, Circuits and Systems , pp. 282-289 
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= fd ' d, 


(11.16) 



where f p , / a , and f A are constants describing the influence of periphery, area, and distance 
fluctuations. 

The periphery component (the first term in Eq. (11.13)) models the effect of edge 
roughness, and it is most significant for small capacitors, which have a relatively large 
amount of edge capacitance. The area component (the second term in Eq. (1 1.13)) models 
the effect of short-range dielectric thickness variations, and it is most significant for 
moderate size capacitors. The distance component models the effect of global dielectric 
thickness variations across the wafer, and it becomes significant for large capacitors or 
widely spaced capacitors. 

Figure 11.7 presents the comparison results between the model and the data for MIM 
capacitors in a 0.18-p.m CMOS process. In Figure 11.7, the distance, that is, the “space” 
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Figure 11.7 Comparison between the model and the measured data for 3-sigma mismatching of 
MIM capacitors. Reproduced from Cheng Y. (2002) The influence and modeling of process varia- 
tion and device mismatch for analog/RF circuit design, Proceedings of the Fourth IEEE International 
Caracas Conference on Devices, Circuits and Systems , pp. 282-289 



in the figure, is defined from the center of one capacitor to the center of the other capacitor. 
It shows that the simple model given by Eqs. (11.3) to (11.6) can predict the measured 
data well. 



11.3.3 Mismatching Models of MOSFETs 

11.3.3.1 Simple model 

The following simple first-order model has been used to the model A/ d results in satura- 
tion region: 
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(11-18) 

(11.19) 

(11.20) 



where / d = /3 ( V gs — V th ) 2 and /3 = [(/u 0 ■ C ox )/2] ■ [W/L], ji 0 is the mobility, C ox is the 
unit-area oxide capacitance, W is the channel width, and L is the channel length. a Vth and 
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ctp are standard deviations of V th and ft parameters. / a and / p are constants describing 
the influence of area and periphery fluctuations. 

Note that it is better to express the fi dependence as a function of crp/fi in percentage, 
so the absolute value of p is not important and the V gs — V* term can be expressed as 
Vg St . The equation can be rewritten in the following way, which is independent of the 
model parameters: 
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Figure 1 1.8 gives the comparison between prediction of the simple model and the mea- 
sured data of 3-sigma mismatching of the devices of 10-p.m channel width and 0.18-p.m 
channel length at various bias conditions. The model describes the trend of the 3-sigma 
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Figure 11.8 Comparison between the simple model and the measured 3-sigma mismatching of 
MOSFETs. Reproduced from Cheng Y. (2002) The influence and modeling of process variation 
and device mismatch for analog/RF circuit design. Proceedings of the Fourth IEEE International 
Caracas Conference on Devices, Circuits and Systems , pp. 282-289 
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mismatching of the device reasonably well by using the extracted values of op% and a v± 
from the measured data. 

11.3.3.2 A physical model 

The simple model discussed above is based on two parameters, used widely by designers, 
V t h and p. However, these two parameters are not independent process variables and 
depend on many physical process parameters, such as oxide thickness, flat band, channel- 
length offset, channel width offset, mobility, and doping concentrations in the channel, 
source/drain, and substrate. By using V* and p directly instead of using these fundamental 
process parameters, the correlations between V* and ft will not be accounted for physically 
in the model, so the simulated results may not predict correctly the device mismatching 
behavior, depending on the variation defined for V t h and p. Also, V gs mismatch is affected 
by these independent process parameters instead of just the V* mismatch, which is only 
a small contribution to the V gs mismatch in some cases. So a mismatching model based 
on the independent process variables is desirable. 

Recently, approaches based on the independent variables to model the mismatching 
have been reported (Zhang and Liou (2001); Oelun et al. (2000); Drennan and McAndrew 
(1999)). Since the SPICE models such as BSIM3 are implemented in simulators in a 
way such that the model equations, some of which include expressions with replaced 
correlations for the independent process variables by fitting parameters, have been hard- 
coded, it is difficult to reuse the correlations in statistical and mismatch simulation. So a 
subcircuit approach is adopted (Cheng (2002)), on one hand, to account for the important 
correlations of the totally independent process variables in the statistical/mismatching 
model, and on the other hand, to include the important components in RF applications 




Channel length (Urn) 

Figure 11.9 Model versus data for devices with different channel lengths. Reproduced from 
Cheng Y. (2002) The influence and modeling of process variation and device mismatch for ana- 
log/RF circuit design. Proceedings of the Fourth IEEE International Caracas Conference on Devices, 
Circuits and Systems, pp. 282-289 
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Figure 11.10 Measured data and model for devices with different channel widths. Reproduced 
from Cheng Y. (2002) The influence and modeling of process variation and device mismatch for 
analog/RF circuit design. Proceedings of the Fourth IEEE International Caracas Conference on 
Devices, Circuits and Systems, pp. 282-289 




Figure 11.11 Measured data and model for devices with different area (IV x L). Reproduced 
from Cheng Y. (2002) The influence and modeling of process variation and device mismatch for 
analog/RF circuit design. Proceedings of the Fourth IEEE International Caracas Conference on 
Devices, Circuits and Systems, pp. 282-289 
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such as gate resistance, substrate resistance, and so on. Both the influence of process 
variation and device mismatching has been considered in the model. For example, the 
parameters such as oxide thickness (T m ), doping concentrations in the channel (N c h), in 
pocket region (/V poc ketX i n source/drain region (/V sc i), in low-doped drain (LDD) region 
(Ni dd), in the gate (N gate ), in the substrate (N sa b), channel-length offset (A L), and channel 
width offset ( A W ) are selected as independent process variables. Some model parameters 
(in BSIM3v3) such as Vjho, K \ . K 2 , T^dsw* Cjo, , Cj SW g • /to, fk-oiir, N\x , 77 vt 1 , and so on 
are modeled as dependent variables linked to these independent parameters by the derived 
correlations in the subcircuit model. 

Figure 11.9 gives the comparison of the model and the data of devices with different 
channel lengths at two different bias conditions (in both linear and saturation). Similarly, 
Figures 11.10 and 11.11 show the comparison of the 3-sigma A/ d versus channel and 
area between the model and the data, respectively. To predict the influence of the process 
variation and device match on the device behavior, both the selection of independent 
process/device variables and the creation of the correlations linking the independent vari- 
ables and the device model parameters are important. For modern technology (0.13 pim 
or less), additional physical effects, becoming significant in advanced technology, should 
be considered in establishing the correlations. When considering most important physical 
effects and the correlations between the independent process variables, the model can 
predict the device mismatch behavior well for devices with different geometry at various 
bias conditions. 
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12 

Quality Assurance 
of MOSFET Models 



12.1 INTRODUCTION 

The present MOSFET models for analog circuit simulation (such as BSIM3, BSIM4, 
and Philips MOS Model 9) are all based on the one-dimensional (ID) theory initially 
developed for long-channel MOSFETs. To keep pace with technology, phenomenological 
modifications of the device models are necessary, resulting in a steady erosion of the 
physical basis and a plethora of model parameters of obscure origin. It is not uncommon 
today that the device models contain more than hundred parameters to take into account 
the many subtle mechanisms that govern the characteristics of deep submicron device 
structures. With this large number of parameters to play around with and if the model 
parameters are not extracted carefully and correctly, strange effects often surface in the 
modeled device characteristics, especially in the small-signal quantities such as transcon- 
ductance and channel conductance. This has become a major problem for the analog 
circuit designers that rely on precise modeling of the devices to accurately predict the 
behavior of their designs to ensure first-time-right silicon and to reduce the time-to-market 
for a product. 

An automated system has been implemented for quality assurance (QA) of MOSFET 
device models that was described for the first time in Risanger et al. (2000). Typically, 
the device models are supplied from the foundries. Upon arrival at the design house, 
the device models are run through a set of benchmark tests and qualified before circuit 
designers are allowed to use them. 

In this chapter we describe the various aspects of this QA system. We start out by 
giving a thorough motivation for the importance of such a system. Then we discuss some 
of the benchmark tests that are included in the QA system. Finally, we describe how the 
tasks of performing the tests are automated. 



12.2 MOTIVATION 

As mentioned above, the device models used today may contain more than hundred param- 
eters. Without careful parameter extraction and good methodology, the model parameter 
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sets produced by the automated characterization systems at the foundries may cause the 
models to behave strangely. This has become a major obstacle for the analog circuit 
designers that rely on reliable models to simulate their designs accurately and efficiently. 
In this section we will discuss the most severe problems encountered by design engineers 
that motivated the introduction of a QA system. 

One problem discovered is related to BSIM3v3.1 and its approach to binning. In 
Figure 12.1, we show the plot of modeled channel conductance gds versus gate length for 
a commercially available 0.5- pm CMOS process. The characterization of this particular 
process utilizes the built-in binning feature of the BSIM3 model. Binning is a model 
extraction approach in which the geometrical space is divided into regions, each having 
a separate model parameter set. A common problem with this approach is discontinuities 
of electrical characteristics across bin boundaries as shown in Figure 12.1. Note that this 
model even predicts a higher output conductance at a gate length of 2. 1 pm compared to 
a length of 1 .9 pm, which is obviously not the case for the real device. 

Another problem we have observed in the modeling of gds is shown in Figure 12.2. 
In this case, the model predicts a monotonically increasing output conductance with gate 
length for devices with gate lengths larger than 10 pm. This example is also from a com- 
mercially available 0.5-prn CMOS process, but the model used here was Philips MOS 
Model 9. 

Without proper model parameter extraction, the intrinsic gain ( g m /gds ) of a MOSFET 
predicted by BSIM3v3.1 may contain a nonphysical bump for gate-source voltages Vg S 
close to the threshold voltage. In the example shown in Figure 12.3, we have plotted 
the intrinsic gain versus V gs using a BSIM3v3.1 model parameter set for a commercially 
available 0.18-pm CMOS process. We notice a quite dramatic bump in the gain close 
to the threshold voltage. Such a behavior will certainly confuse a circuit design engineer 
who is looking for the optimum biasing for his/her amplifier design. 




Figure 12.1 Discontinuous channel conductance across bins for BSIM3v3.1. Reproduced from 
Risanger J. S., Raaum J., and Ytterdal T. (2000) Quality assurance of MOSFET models for analog 
circuit design. First Online Symposium for Electronics Engineers , September 2000 
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Figure 12.2 Nonphysical modeling of normalized output conductance X. Reproduced from 
Risanger J. S., Raaum J., and Ytterdal T. (2000) Quality assurance of MOSFET models for analog 
circuit design. First Online Symposium for Electronics Engineers, September 2000 




Figure 12.3 Nonphysical modeling of intrinsic gain. Reproduced from Risanger J. S., Raaum J., 
and Ytterdal T. (2000) Quality assurance of MOSFET models for analog circuit design. First Online 
Symposium for Electronics Engineers, September 2000 

12.3 BENCHMARK CIRCUITS 

Many of the tests implemented for MOSFET device models were originally described 
in Tsividis and Suyama (1994) and by the SEMATECH benchmark circuits described 
by the Compact MOSFET Council Web page at http://www.eigroup.org/cmc/ . Since the 
introduction of the QA system, the number of tests implemented has grown to cover new 
features and new problems encountered in new technologies released for the commercial 
market. The following tests have been implemented: 

• Drain current characteristics 

• Transfer characteristics in weak and moderate inversion 
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• Transconductance to current ratio 

• Channel conductance 

• Temperature dependency of junction leakage currents 

• Charge conservation 

• Non-quasi-static operation 

• Thermal noise 

• Flicker noise 

• Channel conductance versus channel length and width 

• Intrinsic gain (g m /g As ) 

• Gate leakage current. 

Selected tests are discussed in detail in the following sections. 



12.3.1 Leakage Currents 

With the continuous downscaling of the line widths in integrated circuit processes, the 
MOSFET leakage currents eventually dominate the current consumption of low power 
integrated circuits. Accurate modeling of the leakage currents at high and low temperatures 
are thus of vital importance for designing complex mixed-mode circuits having current 
consumption levels in the sub-microampere regime. 

To reveal the leakage currents modeled in MOSFET device models, the test setup 
given in Figure 12.4 could be utilized. Here, an n -channel transistor is biased at a gate- 
source voltage of zero and a drain- source voltage equal to half of the power supply 
voltage. The drain current 7 d is then calculated versus temperature. It is also possible 
to split the drain current into a drain-source 7 ds and a drain-bulk 7 db current to be able 
to study the two contributions individually. To illustrate qualitatively correct leakage 
current characteristics, we have simulated a state-of-the-art 0.13-p.m «MOS transistor in 
a commercially available process. The width of the device was 0.8 p,m. A qualitatively 
correct plot of the drain current is shown in Figure 12.5 and the corresponding plot of 
the individual contributions to the drain current is shown in Figure 12.6. 



G 




Figure 12.4 Test setup for calculating the leakage currents 
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Figure 12.5 Qualitatively correct drain current of an n -channel MOS transistor having a 
gate-source voltage of OV. The threshold voltage of this process is about 0.4 V 




Figure 12.6 Drain-source 7 ds and drain-bulk 7 db leakage currents versus temperature 

We note from Figure 12.5 that the temperature dependency of the drain current is 
close to exponential in the entire temperature range. To reproduce a similar plot in your 
circuit simulator, the option GMIN must be set to a very low value, much lower than its 
default value of 10 12 S. To check that the leakage of the bulk diodes is properly modeled 
versus temperature, a plot of 7 db as the one shown in Figure 12.6 should be produced by 
the model. 

12.3.2 Transfer Characteristics in Weak 
and Moderate Inversion 

As the power supply voltages are scaled down from one technology generation to the next, 
the gate voltage overdrives used in analog CMOS circuits keep shrinking. Thus, it becomes 
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increasingly important to accurately model the moderate and weak inversion regions. 
This test focuses on the three following issues: inspection of the moderate inversion 
region, check for proper inclusion of the drain-induced barrier lowering (DIBL) effect 
(see Section 1.5. 2. 2), and assure correct modeling of the body effect. 

The DIBL is recognized as one of the short-channel effects. Thus, all models describing 
typical short-channel MOSFETs should be tested. DIBL clearly influences both the leakage 
current and the operating current of the MOSFET device. The test setup is shown in 
Figure 12.7. This test should be run twice, once for investigating the DIBL and once for 
investigating the body effect. 

In the simulation of the DIBL effect, the body and the source terminals are shorted. In 
Figure 12.8 a qualitatively correct plot is shown. The drain current clearly increases with 
the drain voltage owing to the induced shift in the threshold voltage. 

12.3.3 Gate Leakage Current 

As the CMOS technologies are scaled down into the very deep submicron regime, the 
gate oxide thickness becomes thinner and thinner. As a consequence, the gate current is 



L> 




Figure 12.7 Test setup for simulating weak and moderate inversion characteristics 




Figure 12.8 Simulated transfer characteristics for a deep submicrometer hMOS with a gate length 
of 0.13 |xm 
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h 




Figure 12.9 Test setup for simulating gate leakage current 




Gate-source voltage (V) 

Figure 12.10 Simulated gate current versus gate-source voltage. The oxide thickness was 3 nm 

increased considerably. At the 0. 1 3-|im technology node, the static gate current is on the 
order of nanoamperes. Thus, in the design of low-power circuits, this current component 
can no longer be neglected and proper modeling of the gate current characteristics becomes 
important. On the basis of this, we have recently defined a gate leakage current test. The 
setup of this test is shown in Figure 12.9. 

As shown in Figure 12.9, all terminals of the transistor except the gate are connected 
to ground. The gate-source voltage is swept from zero to the maximum value allowed for 
the process. An example of the gate voltage dependency on the gate current is shown in 
Figure 12.10. The simulated results shown in the figure were performed at room temper- 
ature. Since the gate current is dominated by tunneling, the temperature dependency is 
very weak. 



12.4 AUTOMATION OF THE TESTS 

The benchmark tests have been implemented at Nordic VLSI (www.nvlsi.com) using 
the command language called SimPilot and are performed automatically every time a 
new MOSFET model parameter set is received from a foundry. A schematic overview 
of the system implementation is given in Figure 12.11. SimPilot connects directly to the 
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Figure 12.11 Overview of the implementation of the automated system. Reproduced from 
Risanger J. S., Raaum J., and Ytterdal T. (2000) Quality assurance of MOSFET models for analog 
circuit design. First Online Symposium for Electronics Engineers , September 2000 

commercial circuit simulator Eldo and produces output files adapted for the graphical 
post-processor Xelga 1 . In addition to the command script hie, SimPilot must be loaded 
with the actual MOSFET device model parameter hie and a conhguration hie holding 
information about device geometries, temperatures, voltages, and so on. The command 
script hie has a syntax almost identical to that used in UNIX scripts with the addition 
of numerous postprocessing commands specihc to SimPilot. The command script may be 
executed in batch mode for efficient computer resource utilization. Although the script is 
considered static, it is easy to expand it with new functional tests if necessary. However, 
the user settings given in the conhguration hie have to follow a predehned setup. 

The graphical output hies created by SimPilot are given names in accordance to the 
present benchmark test and are observed visually to detect possible shortcomings in the 
MOSFET device model under test. Finally, a report hie containing the test results and 
important notes is generated and published online on the Intranet to make the results 
available to the circuit designers. 
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subthreshold, 30 

Current-voltage characteristics, 17, 25 
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Depletion approximation, 23 
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Depletion layer, 8 
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width. 9 

Depletion regime, 4 
DIBL, 29, 31, 35 
parameter, 35 
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calculation, 149 
components, 142 
Double-gate FET, 37 
Drain, 1 
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Drain induced barrier lowering, 29, 35 
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203, 204, 208 
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FET, 1 
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Field effect transistor, 1 
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Gate, 1 
charge, 40 
leakage, 28, 36, 284 
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Gate dielectric model, 153, 155 
Gate resistance model, 84, 192 
Gate voltage overdrive, 35 
Gate tunneling current model, 

176, 198 
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Generation mechanisms, 9 
Generation time, 10 
Geomod. 172, 202, 203 
Gradual channel approximation, 15 

Harmonic amplitudes, 143 
Harmonic balance, 150 
Harmonic distortion, 142 
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HB, 150 
HD2, 142 
HD3, 142 

Heat dissipation, 33 
Heterostructure FET, 1 
HF noise parameters, 128 
HFET, 1, 30 
High-field effects, 31 

channel length modulation, see CLM 
velocity saturation, 212 
mobility degradation, 213 
Holding time, 30 
Hot-carrier effects, 32 
Hot electron(s), 32 
emission, 32 

IC, 1 

Ideality factor. 12 
Impact ionization, 28, 32 
Induced charge, 18 
Induced gate noise, 133, 134 
Inductors, 260 
Injection barrier, 29 
Insulator, 6, 7, 37 
capacitance, 6 
high-k, 37 
permittivity, 7 
Interface, 32 
barrier, 32 
states, 32 

Integrated circuit, 1 
Integrated resistors, 265 
Intrinsic capacitance, 21 
Integrated inductors, see inductors 
Integrated spiral inductor, see spiral 
inductor 

Intermodulation distortion, 143 
Inversion, 4, 8, 11, 13, 19 
channel, 13 
charge, 11, 19 
moderate, 8 
regime, 4 
strong, 4, 8 

JFET, 1 
Junction 
depth, 34 
FET, 1 

Junction capacitors, 258 

Junction diode capacitance model, 200 

Junction diode models, 198 

Kirchhoff s current law, 41 
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Lateral flux capacitor, 257 
LDD, 32 

Leakage current, 27, 37 
Local oxidation of silicon, 57, 65 
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Low-doped drain. 32 
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Metal resistors, 252 
Metal-semiconductor FET, 1 
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Minimum noise figure, 128, 129, 133, 
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Minority carriers, 3 

Mismatching model of capacitors, 271 
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Mobility, 17, 28, 31, 33, 34, 37 
degradation, 213 
field-effect, 28, 33, 34. 37 
fluctuation model, 121 
gate bias dependence, 31, 33 
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Mobmod, 167, 168 
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Moore’s law, 36 
MOS capacitance, 8 
MOS capacitor, 2, 8 
characterization, 11 
C-V characteristics, 11 
band diagram, 4 
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MOS varactors, 62 
MOSA1 model, 235 
MOSFET, 1, 13. 15, 26, 37, 40 
advanced modeling, 27, 37, 42 
capacitors, 257 
equivalent circuit, 9, 27, 41 
LDD, 32 

leakage currents, 282 
long channel, 14. 16 
narrow channel, 35 
;7-channel, 1, 14 
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MOSFET, ( continued ) 
nonlinearites, 145 
nonlinearity coefficients, 146 
operation, 13 
p-channel, 1 
simple modeling. 15 
small-signal model, 27 
SOI, 32 

square law model, 145 
MOSFET capacitance(s), 21, 22, 26, 37, 
40, 257 
flat-band. 24 
gate-channel, 37 
gate-drain, 22, 38 
gate-source, 22, 38 
gate-substrate, 22 
intrinsic, 21 
junction, 21 
lumped, 21 

Meyer, 22, 23, 26, 38, 40 
model, 21, 37 
oxide. 23 
subthreshold, 24 
Ward Dutton, 22, 40 
MOS structure, 1 

Multidimensional nonlinear conductance 
and transconductance, 143 
Multidimensional nonlinear resistance 
and transresistance, 143 

Narrow channel FET, 35 
Noise model(s), 120, 121, 126, 219, 233 
Noise parameters, 128, 131, 132, 134 
Noise sources, 119, 138, 194, 233 
Nonideal effects, 11, 27, 31 
Nonlinear capacitance, 144 
Nonlinear conductance, 143 
Nonlinear distortion, 142 
Nonlinear transcapacitance, 144 
Nonlinear transconductance, 143 
Nonlinear transresistance, 143 
Non-quasi-static 
behavior, 98 
modeling, 42, 218 
Non-uniform lateral doping, 157 
Non-uniform vertical doping effect, 153 
NQS, 42, 190, 218 
«-well CMOS process, 56 

Ohmic contact, 1 

Overlap capacitances, 21, 26, 59, 77, 95, 
98, 180, 233 



Oxidation process, 57, 58 
Oxide-semiconductor interface, 2 
Oxide, 23, 32 
capacitance, 23 
charges, 32 

Oxide, effective thickness, 36, 37 

Parameter extraction, 74, 75, 78, 83, 148 
Parasitic bipolar transistor, 243 
device structure, 243 
modeling, 243 
nonideal effects, 246 
Parasitic capacitances, 95, 252, 253 
Parasitic resistances, 74, 75, 88, 170, 

197, 225 

Permittivity, 5, 7 
Permod, 202 

Philips MOS Model, 9, 223 
Photocurrent, 32 
Photon, 32 
absorption, 32 
emission, 32 
Pinch-off, 14. 16, 17, 19 
pMOS transistor, 56, 121 
Poisson’s equation, 5, 15 
Poly-poly capacitors, 255 
structure, 255 
equivalent circuit, 256 
Poly resistors, see polysilicon resistors 
Polysilicon gate, 37 

Polysilicon resistors, 60, 64, 65, 271, 253 
model, 253 

equivalent circuit, 253, 254 
Potential offset parameter, 154, 165 
Power dissipation, 33 
Process flow, 48, 49 
Process variation, 263, 264, 266 
Punch-through, 27 
p-well CMOS process, 56 

Quality assurance of models, 279 
Quasi-Fermi potential, 30 
Quasi-static assumption, 42 

Radio frequency, 26 
Rbodymod, 194 
RC network, distributed, 42 
RCSE, 214 

Rdsmod, 170, 171, 172, 175 
Recombination mechanisms, 9 
Relaxation time, charge, 43 
Reliability, 37 
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Resistivity, 249 
Resistors, 249 

Reverse short-channel effect, 214 
RF, 26 

model, 69, 71, 192, 193 
Rgatemod, 192, 193 
Rgeomod, 172, 204, 205 

Saturation, 14, 17, 19 
current, 17 
field, 19 
velocity, 19 
voltage, 14, 16, 19, 20 
Scaling, 27, 36 
CMOS, 27 
device, 36 
Scattering, 33 
SCCM, 11, 16, 25 
Second harmonic, 142 
Second-order intermodulation products, 
143 

Second-order nonlinear circuit behavior, 
142 

Self-alignment, 13 
Self-capacitances, 41 
Self-heating, 32 

Semiconductor-oxide interface, 3 
Semiconductor, 5, 7, 8 
capacitance, 8 
permittivity, 5, 7 
Series resistances, 31 
Shallow trench isolation, 57. 161, 264 
Shooting method, 150 
Short-channel effects, 21, 27, 34, 148. 

157, 158, 160 
Shot noise model, 198 
Silicide-blocked resistor, 65 
Silicon dioxide, 1, 2, 33, 36, 37 
Silicon-on-insulator, 1, 33 
Single-frequency excitation, 142 
Si0 2 , 1, 33 
Skin depth, 250 
Small-signal, MOSFET, 26 
model, 26 

equivalent circuit, 27, 233 
Snapback breakdown, 32 
SOI, 1, 32 
Source, 1 
charge, 40 

SPICE, 15, 26, 29, 31, 33, 42, 43, 149 
example, 214 



Spiral inductors, 260 
structure, 261 
equivalent circuit, 261 
model, 262 

Steady state analysis, 150 
Strong inversion, 4 
onset of, 7 
Substrate current, 32 
models, 179 
Subthreshold, 8, 12, 24 
capacitance, 24 
current, 30 
ideality factor, 12 
regime, 8 
Surface, 6, 28, 33 
electric field, 6 
nonuniformities, 33 
potential, 5, 8, 23 
states, 8 

Symbolic methods, 150 
Symbolic network analysis programs, 
151 

TFT, 30 

Technology node, 1, 27, 37 
Temperature dependence, 33 
THD, 142 
Thermal 

conductivity, 33 
resistance, 33 
voltage, 5, 16 

Thermal noise model, 126. 127, 196, 
197, 211 

Thermionic emission, 29 
Thin film transistor, 30 
Third harmonic, 142 
Third-order nonlinear circuit behavior, 
142 

Threshold condition, 9 
Threshold voltage, 7, 13, 16, 22, 32, 35 
model, 157, 159, 161, 163, 226 
temperature dependency, 219 
Tnoimod, 196, 197 
Total harmonic distortion, 142 
Transcapacitances, 40 
Transconductance, 18 
Tmqsmod, 190, 193 
Tunneling, 4. 27, 32, 37 

UCCM, 12, 30, 35, 235 
Unified charge model, 164, 235 
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Unified C-V model, MOSFET, 37 
Unified MOSFET I-V model, 237 
Uncertainty principle, 37 

Valence band, 4 
Varactor, 60, 62, 63, 258 
Velocity saturation, 212 
Velocity saturation model, 19, 25, 237 
Velocity-field relationship, 19 
Sodini model, 19 
two-piece model, 19 
Vertical FET, 37 



Vertical metal-insulator-metal (VMIM) 
capacitors, 62 
VLSI, 1 

Volterra series, 150 

Voltage saturation, 14, 16, 19, 25 

Vth roll-off, 158, 159, 161 

Ward-Dutton capacitances, 22, 40 
Weakly nonlinear, 150 
Well resistors, 251 

equivalent circuit, 252 
Work function, 3 




